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ABSTRACT 


Digital  watermarking  is  a  highly  evolving  field,  whieh  involves  the  embedding  of 
a  eertain  kind  of  information  under  a  digital  objeet  (image,  video,  audio)  for  the  purpose 
of  eopyright  proteetion.  Both  the  image  and  the  watermark  are  most  frequently  translated 
into  a  transform  domain  where  the  embedding  takes  plaee.  The  seleetion  of  both  the 
transform  domain  and  the  partieular  algorithm  that  is  used  for  the  embedding  of  the 
watermark,  depend  heavily  on  the  applieation.  One  of  the  most  widely  used  transform 
domains  for  watermarking  of  still  digital  images  is  the  Diserete  Cosine  Transform 
domain.  The  reason  is  that  the  Diserete  Cosine  Transform  is  a  part  of  the  JPEG  standard, 
whieh  in  turn  is  widely  used  for  storage  of  digital  images.  In  our  researeh  we  propose  a 
unique  method  for  DCT-based  image  watermarking.  In  an  effort  to  aehieve  robustness  to 
eropping  and  JPEG  eompression  we  have  developed  an  algorithm  for  rating  the  8x8 
bloeks  of  the  image  DCT  eoeffieients  taking  into  aeeount  their  embedding  eapaeity  and 
their  spatial  loeation  within  the  image.  Our  experiments  show  that  the  proposed  seheme 
offers  adequate  transpareney,  and  works  exeeptionally  well  against  eropping  while  at  the 
same  time  maintains  suffieient  robustness  to  JPEG  eompression. 
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EXECUTIVE  SUMMARY 


Watermarking  is  a  method  of  providing  protection  of  intellectual  property  in 
digital  multimedia,  and  is  based  on  hiding  a  digital  signature  within  the  data.  With  this 
signature  one  can  identify  the  proprietor  of  a  certain  set  of  data  and  thus  protect  her/his 
intellectual  property.  In  order  for  the  watermarking  to  be  dependable  it  is  imperative  that 
it  has  certain  characteristics.  The  most  important  of  these  are:  transparency  of  the 
watermark  (it  should  be  imperceptible  to  the  Human  Visual  System),  and  robustness 
against  common  tampering  with  the  image.  This  tampering  may  include  JPEG 
compression,  or  cropping.  With  our  work  we  provide  a  technique  that  gives  satisfactory 
results  in  terms  of  transparency  and  robustness  against  JPEG  compression  and  cropping. 
The  new  feature  in  our  work  has  to  do  with  the  method  we  use  for  the  matching  of  the 
image  blocks  and  the  watermark  coefficients  that  are  embedded  in  each  block. 

The  embedding  takes  place  in  the  DCT  domain,  which  is  also  used  by  the  JPEG 
standard,  and  allows  for  the  exploitation  of  the  domain's  particular  characteristics  and  the 
achievement  of  watermark  transparency.  Both  the  watermark  and  the  image  are  DCT 
transformed.  We  have  developed  a  method  for  rating  the  8x8  blocks  of  the  DCT  of  the 
image  according  to  their  Priority  Coefficient  (PC),  which  is  a  measure  of  their  embedding 
capacity  and  their  resistance  to  cropping. 

Eor  each  8x8  block  of  the  DCT  coefficients  of  the  image,  we  calculate  the 
Complexity  Eactor  (CE),  a  novel  metric  for  measuring  the  capacity  of  each  block  to 
receive  watermark  coefficients.  We  know  that  in  the  areas  of  the  image  where  we  have 
more  “action”  we  can  embed  more  information  imperceptibly.  In  the  literature  there  have 
been  attempts  to  use  the  variance  of  each  8x8  block  of  the  image  as  a  measure  of 
imperceptibility  after  watermark  embedding.  We  show  that  the  Complexity  Eactor  as  a 
capacity  metric  is  a  more  accurate  approach  since  the  variance  of  the  image  blocks  alone, 
does  not  necessarily  manifest  the  actual  visual  properties  of  the  particular  spatial  section 
of  the  image. 

Additionally,  for  each  block  of  the  cover  image  we  calculate  the  Center  of  Interest 
Proximity  Eactor  (CIPE),  which  is  a  measure  of  significance  of  each  8x8  block  with 


XV 


respect  to  cropping  resistance.  We  first  determine  the  Euclidean  distance  r,  between  the 
center  of  the  block,  and  the  Center  of  Interest  (Cl).  In  our  experiments  we  assumed  that 
the  Cl  is  the  center  of  the  image.  The  Euclidean  distance  r,  is  then  normalized  over  the 
diagonal  (i.e.  the  maximum  possible  distance  within  the  image)  to  produce  a  normalized 
value  rnorm.  This  normalized  distance  is  then  processed  by  a  transformer  with 
characteristic  function/ 

1  _  2  1 

/ {rnorm)  = - tan“'  {k  •  {rnorm - ))  H —  , 

TT  3  2 

to  result  in  the  CIPE  (CIPF=/(rnorm)).  The  distribution  of  the  CIPF  over  the  8x8  blocks 
of  a  256x256  image  can  be  depicted  in  figure  1. 


Figure  1.  Distribution  of  the  CIPF  over  the  8x8  blocks  of  a  256x256  image  with 

k=15. 


The  CF  of  each  8x8  block  is  scaled  by  the  CIPF  to  produce  a  Priority  Coefficient 
(PC),  which  is  attached  to  the  block  and  contains  all  the  information  that  is  required  for 
its  rating.  The  blocks  are  now  sorted  by  descending  order  of  their  PC. 

The  DCT  coefficients  of  the  watermark  are  sorted  according  to  magnitude  and 
divided  into  m  groups  of  descending  magnitude  with  equal  number  of  elements.  We  then 
form  embedding  sets  of  coefficients.  Each  set  contains  m  coefficients,  one  from  each 
group.  By  this  scheme,  we 
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•  embed  the  largest  coeffleients  in  the  bloeks  with  the  larger  capaeity  thus,  ensuring 
transparency, 

•  avoid  block  saturation,  which  would  very  likely  occur  if  only  large  coefficients  were 
embedded  in  one  block,  and, 

•  protect  the  largest  coefficients  (which  are  the  most  important  ones)  by  embedding 
them  to  the  blocks  which  are  more  unlikely  to  be  cropped. 

The  sets  are  then  embedded  into  m  frequency  coefficients  of  the  8x8  DCT  image 
blocks.  Embedding  in  the  lowest  frequencies  allows  for  higher  robustness  of  the 
watermark  against  JPEG  compression,  since  these  coefficients  are  the  least  affected  by 
the  quantization  process.  However,  the  lower  frequencies  are  the  most  perceptible  ones, 
but  we  manage  to  compensate  for  the  latter,  by  appropriately  adjusting  a  weighting  factor 
a. 

The  decoder  works  in  reverse  order  and  requires  both  the  original  image  and  the 
watermark.  The  level  of  detection  is  based  on  the  correlation  coefficient  p,  which  is  given 
by 


P  = 


j)Wr{i,  j) 
'  J 


ZZltT(/,i)l^  -ZZltTffi,;)]^ 
I  J  t  J 


and  is  a  measure  of  similarity  between  the  watermark  W  and  the  extracted  pattern  Wr. 
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I.  INTRODUCTION 


With  the  recent  developments  in  digital  communications  and  digital  signal 
processing,  and  the  expansion  of  the  Internet,  the  proliferation  of  digital  material  (audio, 
images,  video)  has  become  extremely  easy.  The  possible  implications  of  this  situation 
include  the  unauthorized  distribution  of  such  material  with  the  purpose  of  making  illegal 
profit  or  otherwise  damaging  the  legal  owner.  Inevitably  the  business  world  and  the 
authorities  have  expressed  great  concern  over  this  issue,  and  as  a  result,  the  scientific 
community  has  become  extremely  active  trying  to  provide  techniques  for  copyright 
protection  of  digital  material. 

Towards  this  direction  several  types  of  secure  communication  methods  are  being 
explored  addressing  different  aspects  of  the  problem.  These  methods  are  either  evolution 
of  previously  discovered  techniques  (types  of  encryption  date  back  to  the  Roman  era)  or 
innovations  that  are  dictated  by  the  recent  technologic  developments. 

Watermarking  is  a  method  of  providing  protection  of  intellectual  property  in 
digital  multimedia,  and  is  based  in  principle  on  hiding  a  digital  signature  (not  to  be 
confused  with  the  term  signature  as  used  in  cryptography)  within  the  data.  With  this 
signature  one  can  identify  the  proprietor  of  a  certain  set  of  data  and  thus  protect  her/his 
intellectual  property. 

A.  PURPOSE 

The  purpose  of  this  research  is  to  investigate  the  application  of  the  Discrete 
Cosine  Transform  in  Digital  Watermarking. 

In  this  thesis  we  deal  with  watermarks  for  digital  images.  In  order  for  the 
watermarking  to  be  dependable  it  is  imperative  that  it  has  certain  characteristics.  The 
most  important  of  these  are:  imperceptibility  of  the  watermark  to  human  eye,  and 
robustness  against  innocent  or  malicious  tampering  with  the  image.  Among  the  most 
common  ways  of  tampering  with  an  image  are:  cropping,  JPEG  compression,  resizing, 
filtering  etc.  It  is  these  characteristics  that  dictate  the  continuous  research  on  the  field  for 
the  development  of  a  robust  scheme. 
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In  general  a  watermarking  teehnique  involves  the  transformation  of  the  image  to  a 
transform  domain  (FFT,  DCT,  DWT),  if  other  than  the  spaee  domain,  and  the  embedding 
of  the  watermark  eoeffieients  on  some  or  all  of  the  image  eoeffieients.  The  selection  of 
the  embedding  domain  has  to  do  with  the  specific  characteristics  we  want  to  exploit.  The 
DCT  domain  in  particular,  is  very  popular  in  the  watermarking  community.  The  DCT  is  a 
part  of  the  JPEG  standard,  and  JPEG  is  in  turn  a  very  widely  used  image  compression 
technique.  By  embedding  the  watermark  in  the  DCT  domain  we  can  therefore  create 
embedding  schemes  that  are  particularly  robust  against  JPEG  compression. 

Through  the  course  of  this  research  a  considerable  amount  of  relevant  work  was 
examined  and  evaluated  in  terms  of  their  results.  Part  of  this  work  served  as  the  basis  for 
the  development  of  our  testing  platforms.  Starting  from  basic  principles  we  have 
developed  a  complete  watermarking  scheme.  Our  scheme  has  been  tested  against 
different  attacks  and  proved  to  be  adequately  transparent  and  robust.  Additionally  it  has 
been  tested  for  different  embedding  parameters  and  results  have  been  produced  and 
evaluated.  Slight  variations  of  the  basic  algorithm  have  also  been  developed  and 
investigated  in  an  effort  to  reach  better  results. 

In  this  thesis  we  present  a  unique  method  for  rating  the  8x8  blocks  of  the  image 
DCT  coefficients  according  to  their  embedding  capacity.  Eurthermore,  an  algorithm  has 
been  developed  for  determining  the  watermark  coefficients  that  are  embedded  in  each 
block.  Our  goal  was  to  achieve  maximum  transparency  and  robustness  against  cropping 
and  compression  at  the  same  time. 

B,  RESEARCH  QUESTIONS 

There  are  a  number  of  research  questions  that  we  strive  to  answer  in  this  thesis. 

Eirstly,  we  attempt  to  analyze  the  Discrete  Cosine  Transform  and  its  potentials  as 
a  watermarking  method.  We  investigate  the  watermark  characteristics  that  affect  the 
performance  of  a  watermarking  scheme  and  also  the  arguments  for  supporting  perceptual 
or  random  watermarks. 

Since  in  almost  every  watermarking  transparency  is  paramount,  we  discuss  the 
parameters  that  may  be  used  for  determining  the  capacity  of  each  image  block  following 

ideas  that  have  been  suggested  in  the  literature. 
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Identifying  the  factors  that  affect  the  quality  of  a  watermarking  scheme  when 
under  cropping  or  JPEG  compression  attacks  was  one  of  the  basic  elements  of  our 
research.  The  result  was  the  development  of  the  new  algorithm  that  is  proposed  here. 
Consequently,  the  evaluation  of  the  robustness  of  the  proposed  algorithm  under  attack 
became  also  one  of  the  primary  objectives. 

C.  THESIS  OUTLINE 

This  thesis  is  organized  as  follows: 

Chapter  II  provides  the  background  required  for  the  novice  in  the  field.  After  a 
brief  historic  overview,  Digital  Watermarking  is  identified  among  other  relevant 
technologies  and  the  lines  between  these  technologies  are  drawn.  The  needs  that  dictated 
the  development  of  this  technology  are  explained  and  also  the  requirements  of  a  Digital 
Watermark  that  stem  from  these  needs  are  reviewed.  Definitions  for  terms  and  concepts 
pertaining  specifically  to  Watermarking  are  given  and  they  serve  as  a  tool  for  better 
understanding  the  different  approaches.  Finally  the  different  watermarking  techniques 
that  have  been  developed  are  reviewed  with  emphasis  given  in  comprehending  the 
principal  differences  between  them.  Brief  examples  of  recent  research  work  are  given  in 
order  to  support  our  arguments. 

Chapter  III  involves  a  more  technical  insight  of  the  technology,  and  the 
mathematical  tools  necessary  for  comprehension  of  our  research  are  presented.  In  that 
context  the  Discrete  Cosine  Transform  (DCT)  is  analyzed  and  its  connection  to  the  JPEG 
standard  is  discussed.  The  JPEG  standard  is  reviewed  and  all  is  elements  namely  the 
DCT,  quantization,  and  encoding  are  explained. 

In  Chapter  IV  the  train  of  thought  that  led  to  the  development  of  the  new 
algorithm  is  shown.  As  our  reasoning  progresses,  a  step-by-step  implementation  of  a  new 
algorithm  is  revealed,  and  the  way  we  attacked  the  problem  is  analyzed.  This  chapter  is 
divided  into  three  sections.  In  the  first  section  new  terms  and  concepts  are  introduced  and 
explained.  In  the  next  section,  we  propose  a  new  algorithm  that  deals  with  the 
transparency  problem  and  offers  sufficient  robustness  against  cropping  and  JPEG 
compression.  Finally  we  offer  a  description  of  the  Watermark  recovery  process  that  was 
used  in  the  proposed  scheme. 
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Throughout  our  research  several  schemes  were  tried  and  evaluated.  These 
schemes  are  presented  in  Chapter  V  regardless  of  their  effectiveness  because  they  can  be 
the  basis  of  future  work.  Finally  the  algorithm  that  was  used  in  certain  key  elements  of 
our  scheme  is  analyzed. 

Chapter  VI  presents  experimental  results  validating  the  arguments  in  Chapter  IV 
and  V.  We  start  from  the  images  and  the  watermarks  that  were  used,  and  the  reasons  why 
we  chose  these  in  particular.  The  results  of  our  experiments  are  collectively  presented 
here.  We  made  an  effort  to  present  the  results  in  such  a  way  that  they  would  better 
support  the  conclusions  of  the  next  chapter. 

Finally  in  Chapter  VII  our  work  is  briefly  summarized  and  conclusions  following 
the  experimental  results  are  made.  Also  we  make  suggestions  for  possible  future  work 
based  on  this  material. 

D.  EXPECTED  BENEFITS  OF  THE  THESIS 

Digital  watermarking  is  a  research  area  still  being  under  exploration.  None  of  the 
methods  proposed  so  far  has  yet  dominated,  while  the  market  is  still  in  need  of  a 
dependable  scheme  that  will  provide  watermarking  robustness.  It  is  yet  not  known  if  the 
development  of  a  composite  watermarking  algorithm  that  will  be  used  for  different 
applications  is  feasible.  So  far  it  appears  that  even  for  objects  of  the  same  data  type  the 
watermarking  algorithms  that  have  been  developed,  seem  to  address  very  specific 
problems  (for  example  in  digital  image  watermarking  the  DCT  based  watermarks  were 
primarily  used  to  address  the  problem  of  JPEG  compression).  Towards  this  direction 
researchers  all  over  strive  to  make  all  the  necessary  steps  that  will  lead  to  a  complete, 
dependable  watermarking  algorithm. 

With  our  research  we  try  to  investigate  how  the  different  watermarking 
parameters  affect  the  quality  of  our  product.  The  issue  of  embedding  the  watermarks  in 
selected  image  blocks,  that  allow  imperceptible  embedding  is  also  addressed.  Finally,  we 
propose  a  new  algorithm  that  may  serve  as  the  basis  for  further  research  in  the  field.  We 
hope  that  this  research  contributes  towards  the  direction  of  developing  a  composite  image 
that  addresses  collectively  all  the  possible  attacks. 
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II.  BACKGROUND  ON  DIGITAL  WATERMARKING 


A,  HISTORIC  REVIEW 

The  problem  of  aehieving  hidden  communieation  between  two  parties  has  been 
investigated  for  thousands  of  years.  One  eould  safely  assume  that  from  the  moment 
mankind  formed  organized  military  groups  that  were  engaged  in  wars  of  any  extent,  the 
need  for  secure  communications  between  members  of  the  same  group  was  probably 
experienced.  There  have  been  mainly  two  approaches  towards  a  solution;  cryptography 
and  steganography.  Both  words  are  derived  from  Greek  (cryptography:  Kpvmog 
(=hidden)  +  ypdcpeiv  (=writing),  steganography:  aisyavog  (=protected)  +  ypdfeiv).  Their 
distinction  is  based  on  the  following:  cryptography  is  a  way  of  communication,  where  the 
information  to  be  secured  is  scrambled  by  the  use  of  certain  code,  in  a  way  that  a  third 
party,  without  the  code,  would  be  unable  to  retrieve  the  information;  steganography  on 
the  other  hand,  is  trying  to  achieve  secure  communication  by  hiding  the  existence  of  the 
message. 

There  is  written  evidence  that  secure  communication  techniques  were  exercised 
from  as  early  as  the  years  of  Homer.  The  most  frequently  cited  evidence  though,  is  in  the 
descriptions  of  the  Greek  historian  Herodotus  of  Halicarnassus  (440  BC).  He  states  that  a 
slave  was  sent  by  his  master,  Histiaeus,  to  Aristagoras  the  ruler  of  the  city  of  Miletus. 
The  slave  was  carrying  a  message  for  Aristagoras  tattooed  on  his  scalp.  After  tattooing 
the  message  he  let  his  hair  grow  back  again.  Only  when  he  had  safely  traveled  to  Miletus 
did  the  slave  shave  his  head  to  reveal  the  message  to  Aristagoras  encouraging  him  to 
revolt  against  the  Persian  King. 

Aeneas  the  Tactician  of  Greece  in  one  of  his  earliest  books  on  military  science. 
On  the  Defense  of  Fortified  places,  described  as  early  as  the  fourth  century,  a  system  of 
cryptography.  The  Caesar  Cipher  attributed  to  the  Roman  emperor  Julius  Caesar  (lOOBC 
-  44BC)  was  used  for  the  communication  between  him  and  his  generals.  It  was  based  on 
shifting  each  letter  of  the  communicated  text  by  a  certain  number  of  positions  in  the 
alphabet.  The  amount  of  shifting  was  known  only  to  him  and  to  his  generals.  For 
everybody  else  the  message  had  absolutely  no  meaning.  Petitcolas  et  al.  in  his  work 
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Information  Hiding  -  A  Survey  ([1]),  does  a  considerable  research  on  the  use  of  secure 
communication  techniques  throughout  history.  Among  others  he  mentions  that  the  head 
shaving  technique  that  was  used  by  Histiaeus  back  in  the  classical  Greece  was  also  used 
by  German  spies  in  the  beginning  of  the  twentieth  century. 

As  Ryan  describes  in  [2]  the  Russian  failure  at  Tannenberg  in  August  of  1914  saw 
the  complete  destruction  of  two  Russian  armies  by  a  single  German  army  half  their 
combined  size.  This  decisive  victory  directly  resulted  from  the  fact  that  the  Russian 
communications  were  compromised.  The  Russians  had  failed  to  distribute  the  military 
ciphers  and  their  keys  making  it  impossible  for  the  two  neighboring  armies  to  securely 
communicate.  All  the  Russian  communications  as  the  battle  progressed  were  in  the  clear 
and  therefore  the  Germans  knew  exactly  what  the  Russian  plans  were,  sometimes  even 
before  the  Russian  had  received  the  orders  by  their  command.  The  result,  as  Ryan  clearly 
puts  it,  was  that  in  the  end,  30,000  Russians  were  killed  or  missing,  100,000  were 
captured,  one  of  the  two  Russian  armies  was  devastated  and  one  simply  ceased  to  exist, 
all  at  the  guns  of  the  smaller  but  more  mobile  German  army  with  its  infinitely  more 
secure  communications. 

In  the  same  work  the  author  reveals  that  although  the  Japanese  policy  stressed  the 
importance  of  communications  security,  their  practices  and  procedures  implementing  that 
security  were  slipshod.  Admiral  Nimitz,  thanks  to  the  American  cryptanalysis,  was  in 
hold  of  all  the  information  that  the  captains  of  the  Japanese  ships  knew  about  the  battle  of 
Midway.  The  advantage  of  surprise  that  Yamamoto  depended  upon  was  lost  due  to 
American  cryptanalysts,  and  this  cost  the  Japanese  the  battle,  which  turned  the  tide  of  the 
war. 

Addressing  specifically  the  watermarking  history,  we  know  that  paper 
watermarks  appeared  in  the  art  of  hand  papermaking  nearly  700  years  ago.  According  to 
Hartung  and  Kutter  ([3]),  the  oldest  watermarked  paper  found  in  archives  dates  back  to 
1292  and  has  its  origin  in  Fabriano,  Italy,  which  is  considered  the  birthplace  of 
watermarks.  Thereafter,  paper  watermarking  spread  quickly  all  over  Europe  and  beyond 
its  use  as  a  security  feature,  served  also  as  an  indication  for  paper  format  and  quality. 
Paper  watermarks  were  also  used  to  date  and  authenticate  paper. 
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Paper  watermarking  is  still  used,  and  is  one  of  the  major  security  measures  in 
today's  banknotes.  The  EURO  (€)  that  was  introduced  only  a  few  months  ago  among  the 
European  Union  Countries  was  designed  to  have  a  watermark  as  one  of  its  security 
features.  In  figure  2  we  can  see  the  watermark  in  the  5€  banknote. 

Cox  et  al.  in  their  recent  book  on  Digital  Watermarking  ([4])  make  a  reference  to 
the  book  "The  Codebreakers",  by  Kahn,  where  there  are  stories  of  information  hiding 
which  are  more  relevant  to  watermarking.  It  is  described  in  particular,  that  in  the  book 
"Hypnerotomachia  Poliphili",  which  was  anonymously  published  in  1499,  there  was  a 
secret  message  hidden.  Putting  together  the  first  letters  of  each  chapter  you  would  form 
the  phrase  "Poliam  Prater  Pranciscus  Columna  Peramavit",  which  means  "Pather 
Prancesco  Columna  loves  Polia". 


Pigure  2.  The  5  EURO  banknote  and  its  watermark  (copied  from  the  European 
Central  Bank  site  for  the  new  currency  at  http://www.euro.ecb.int/I 

In  the  same  book  [4],  the  authors  mention  a  story  that  takes  place  in  the  mid 
twentieth  century  and  involves  the  use  of  a  watermark  very  similarly  to  the  way 
watermarks  are  used  now,  in  the  digital  world.  Specifically,  in  1954,  Emil  Hembrook,  of 
the  Muzak  Corporation,  inserted  an  identification  code  in  music  by  intermittently 
applying  a  notch  filter  centered  at  IKHz.  He  used  the  Morse  coding,  and  therefore  by  the 
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absence  of  energy  that  the  filter  caused,  and  by  its  duration,  one  could  identify  the  hidden 
information.  It  is  interesting  to  note  that  this  invention  is  described  by  the  US  Patent  in 
1961,  as  an  invention  that  makes  possible  the  identification  of  the  origin  of  a  music 
presentation,  therefore,  preventing  piracy. 

There  is  clearly  a  connection  between  paper  watermarks,  steganography,  and 
digital  watermarking.  In  fact,  paper  watermarks  in  banknotes  probably  inspired  the  first 
use  of  the  term  watermarking  in  the  context  of  digital  data  ([3]). 

It  is  debatable  who  were  the  first  to  introduce  the  term  digital  watermarking  ([4], 
[3]).  What  appears  to  be  more  accurate  is  the  Cox  version,  which  states  that  the  first  to 
use  the  term  were  Komatsu  and  Tominaga,  in  1988.  It  took  however  a  few  more  years 
until  1995/1996  before  watermarking  received  remarkable  attention.  Since  then,  digital 
watermarking  has  evolved  very  quickly,  something  that  can  be  verified  by  the  amount  of 
papers  published  on  the  subject. 

Nowadays  many  corporations  and  institutions  are  active  in  the  field.  As  an 
example  we  mention  the  International  Standard's  Organization  (ISO)  taking  interest  in  the 
technology  in  the  context  of  designing  advance  MPEG  standard.  The  DVD  and  audio  CD 
industries  also  strive  to  produce  secure  watermarks.  The  significance  of  the  research 
going  on  can  be  perceived  by  the  example  of  the  SDMI  foundation  and  Verance 
Corporation  that  threatened  to  bring  a  lawsuit  against  a  scientific  team  that  participated  in 
a  "public  challenge",  broke  their  algorithm,  and  attempted  to  publish  the  results  in  a  paper 
titled  "Reading  Between  the  Lines:  Lessons  from  the  SDMI  Challenge"  by  Craver  et  al. 

B,  GENERAL  CONTEXT  OF  INFORMATION  HIDING 

In  the  literature  there  have  been  several  attempts  for  categorizing  the  different 
methods  of  secure  communications.  These  attempts  do  not  always  agree  and  in  some 
cases  may  even  be  conflicting.  The  terms  used  are  general  by  nature,  and  thus  they  are 
frequently  overlapping.  We  will  try  to  describe  the  idea  behind  these  terms  and  give  an 
overview  of  how  they  are  related  to  each  other,  but  we  will  avoid  any  attempt  to  form  a 
strictly  defined  Secure  Communications’  tree. 

Encryption  was  up  to  a  few  years  ago  the  only  available  means  for  protection.  It 
involves  the  scrambling  of  the  data  with  a  key,  which  makes  it  difficult  (depending  on  the 
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quality  and  the  complexity  of  the  encryption  algorithm)  for  an  eavesdropper  to  gain  any 
information  on  the  content  of  the  data  that  is  being  exchanged.  A  key  (the  same  or  a 
different  one  depending  on  the  encryption  algorithm)  should  also  be  used  by  the 
authorized  recipient  in  order  for  the  message  to  be  decrypted.  The  message  on  the 
communication  channel  is,  in  this  case,  meaningless  for  those  not  in  hold  of  the  Key.  An 
unauthorized  third  party,  knowing  that  an  encrypted  message  is  being  exchanged,  should 
try  to  break  the  encryption  algorithm.  Depending  on  the  type  of  the  message  sent,  one  can 
select  the  strength  of  the  encryption  algorithm.  For  example  if  the  message  is  “attack  the 
enemy  on  January  K*,  at  01:00”  an  algorithm  that  will  not  be  broken  by  January  D*  will 
suffice.  Beyond  that  point  the  attack  has  already  commenced,  and  it  is  reasonable  to 
assume,  that  the  enemy  has  already  found  out.  So  the  type  of  the  message  determines  the 
cost  one  has  to  pay,  in  terms  of  time  and  resources,  in  creating  an  algorithm  with  the 
appropriate  strength. 

With  this  scheme  we  achieve  protection  during  the  transmission  of  the  message 
but  we  have  no  protection  whatsoever  in  a  case  where  the  contents  of  the  message  are 
publicly  available  but  their  redistribution  is  not  authorized.  An  audio  CD  for  example 
contains  information  that  is  readily  available  for  public  use  but  unauthorized  copying  of 
its  contents  is  illegal. 

1.  Information  Hiding 

A  widespread  term  describing  a  broad  area  of  secure  communication  methods  is 
Information  Hiding.  It  is  a  general  term  that  encompasses  different  kind  of  problems.  All 
of  these  problems  have  as  a  common  denominator  the  effort  to  prohibit  an  unauthorized 
third  party  from  obtaining  access  to  a  message.  Despite  our  feeling  that  Information 
Hiding  should  be  used  interchangeably  with  Communications  Security  this  is  not 
generally  the  case.  In  most  of  the  literature  Information  Hiding  is  treated  as  a  subcategory 
of  Communications  Security,  along  with  Cryptography. 

Information  Hiding  may  refer  to  either  making  the  information  imperceptible  or 
keeping  the  existence  of  the  information  secret  [4].  Petitcolas  et  al.  ([1])  describe 
Information  Hiding  as  traffic  security,  treating  it  separately  from  encryption.  According 
to  the  same  authors  this  discipline  also  includes  such  technologies  as:  spread  spectrum 
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radio  communications  (widely  used  in  the  military  in  an  effort  to  keep  seeret  the 
exehange  of  radio  transmissions),  temporary  mobile  subseriber  identifiers  (provide  to 
some  extend  loeation  privacy  to  users  of  digital  telephones),  and  anonymous  remailers 
(whieh  eoneeal  the  identity  of  the  sender  of  an  e-mail  message). 

2,  Steganography 

Steganography  is  the  art  of  trying  to  keep  eoneealed  the  very  existence  of  the 
eommunieation  ehannel.  Some  examples  of  steganographie  attempts  throughout  history 
have  been  shown  in  Section  A  above.  In  general,  steganography  falls  under  the 
Information  Hiding  root.  In  the  Information  Hiding  tree  provided  in  [I],  there  are  further 
subdivisions  of  steganography  (Linguistie,  Teehnieal)  that  we  are  not  going  to  eover. 

3.  Covert  Channels 

Covert  ehannels  are  deseribed  as  ehannels  that  were  not  designed  for  the  purpose 
of  exehanging  of  information.  The  term  is  primarily  used  in  eomputer  seeurity  and 
deseribes  the  method  that  is  used  by  programs  that  communieate  information  to 
unauthorized  parties. 

The  most  common  way  to  implement  this  idea,  is  by  inserting  a  Trojan  horse  into 
a  serviee  program.  The  user  is  normally  unsuspeeting  of  the  situation  and  when  using  the 
serviee  program  he  automatieally  leaks  information.  To  better  appreeiate  this  teehnique 
we  will  provide  a  partieular  example  of  eovert  channels:  the  storage  channels  ([5]).  In 
multiuser  systems,  the  operating  system  does  not  normally  allow  users  to  write  to  the 
same  file  at  the  same  time  in  order  to  prevent  its  possible  eorruption.  Every  file  in  use  is 
"loeked",  and  thus  any  "write"  request  from  other  programs  is  rejeeted  by  the  operating 
system.  A  eovert  ehannel  ean  signal  a  1-bit  information  by  whether  or  not  a  file  is  loeked. 
At  this  time  the  serviee  program  may  be  reading  eonfidential  data  and  the  Trojan  horse 
signals  the  data  one  bit  at  a  time  by  bloeking  or  not  an  irrelevant  predetermined  file.  The 
only  extra  requirement  for  the  implementation  of  this  teehnique  is  that  the  serviee 
program  (with  the  Trojan  horse)  and  the  unauthorized  third  party  have  a  eommon  timing 
souree. 

C.  WATERMARKING 

1.  Watermarking  in  the  Digital  World 
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The  developments  in  the  networking  teehnology  and  the  worldwide  web  have 
signifieantly  inereased  the  risk  of  piracy.  The  situation  now  has  very  much  evolved  from 
the  days  were  the  only  storage  means  was  a  tape,  and  any  kind  of  reproduction  resulted  in 
copies  that  were  degraded  versions  of  the  original  object.  Nowadays  the  multimedia  data 
are  available  on  the  Internet  in  digital  form,  which  allows  for  the  reproduction  of  exact 
copies  of  the  original.  Additionally,  copying  devices  are  quite  efficient,  and  most 
importantly,  inexpensive,  and  therefore  virtually  anyone  could  afford  its  use.  Considering 
both  these  factors  we  have  all  the  necessary  requirements  to  managing  illegal  distribution 
of  digital  multimedia  and  thus  financially  damage  the  legal  owner  of  an  object. 

As  already  explained  in  the  introduction,  cryptography  is  not  an  adequate  method 
when  it  comes  to  the  protection  of  material  that  is  publicly  available  but  its  redistribution 
is  unauthorized.  The  watermarking  technology  potentially  offers  the  solution  to  this 
problem. 

Steganography  and  watermarking  have  been  developed  based  on  the  same 
theoretical  roots,  that  we  want  to  keep  a  secret  message  hidden  from  an  intruder. 
However  there  are  conceptual  differences  between  the  two.  Firstly,  the  latter  requires 
extra  robustness  against  attacks  since  our  priority  is  to  maintain  the  integrity  of  the  secret 
message  /  watermark.  In  steganography,  on  the  other  hand,  the  assumption  is  that  there 
will  be  no  such  attacks  against  the  hidden  message  only  because  its  very  existence  is 
secret.  Any  kind  of  attack  on  the  object  carrying  the  message  is  of  no  importance  because 
it  only  serves  as  a  cover  of  our  real  intentions.  Here  only  the  secrecy  of  the 
communication  path  is  paramount.  In  watermarking,  there  are  cases  where  we  select  to 
reveal  the  existence  of  a  watermark  on  our  object  (the  intruder  knows  that  there  is  a  secret 
message  but  does  not  know  how  to  remove  it),  challenging,  in  a  way,  potential  attackers. 
However,  this  may  serve  as  a  deterrence,  since  an  intruder  might  not  select  to  attack  an 
image  knowingly  marked.  In  other  words  in  steganography  an  intruder  strives  to  detect 
the  existence  of  a  secret  communication  path  and  to  retrieve  the  hidden  information 
regardless  of  the  effect  on  the  cover  object,  while  in  watermarking  an  intruder  aims  at 
removing  the  watermark  while  at  the  same  time  maintaining  the  quality  of  the  object. 
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We  realize  that  the  developments  in  the  watermarking  technology  were  dictated 
by  the  need  for  copyright  protection  of  digital  material.  And  this  need  resulted  from  the 
developments  of  other  technological  fields  i.e.  networking,  storage  and  reproduction  of 
digital  data  etc. 

2,  Requirements 

There  are  many  different  everyday  situations  where  the  watermarking  technology 
would  be  applicable.  The  first  that  comes  to  mind  is  proof  of  ownership.  When  you 
publish  an  image  in  the  web  and  you  want  to  retain  the  copyrights,  you  need  to  have  a 
means  of  proving  your  ownership  in  a  dispute.  Registering  the  image  with  the  Office  of 
Copyrights  and  Patents  would  be  the  most  appropriate  action.  However  this  is  not  always 
what  people  do,  either  to  avoid  the  cost  involved  or  simply  to  avoid  extra  paperwork.  In 
general  you  want  to  put  a  digital  signature  in  the  object  you  own,  in  a  way  that  only  you 
can  extract  it.  Any  copy  of  the  object  would  carry  that  same  signature.  If  an  adversary 
wanted  to  steal  your  property  he  would  have  to  extract  your  watermark  from  the  object, 
and  maybe  insert  his  own  instead.  This  situation  dictates  one  of  the  properties  that  this 
specific  type  of  watermark  should  have:  robustness  against  any  kind  of  tampering  with 
the  image.  The  appropriate  watermark  should  not  be  easily  extracted  from  the  image  and 
if  it  did,  the  image  should  be  so  much  degraded  (in  terms  of  quality)  that  would  not  serve 
any  purpose  to  the  unauthorized  user. 

Next,  we  describe  a  situation  where  the  owner  of  an  object  makes  a  legal 
agreement  for  supplying  his  object  to  clients.  We  need  a  watermark  to  identify  which  of 
the  clients  broke  this  agreement  and  supplied  the  object  to  third  parties.  The  watermark  in 
this  case  serves  as  a  serial  number,  it  should  be  robust  against  attacks,  and,  at  the  same 
time,  unique  for  each  customer. 

A  different  type  of  watermark  should  be  used  for  verifying  that  a  certain  copy  of 
an  object  is  indeed  a  credible  copy  and  that  it  has  not  been  tampered  with,  in  a  manner 
that  "critically"  alters  its  contents.  There  are  tamper-proofing  techniques  that  accurately 
detect  that  an  object  has  been  tampered  with  [6].  But  these  techniques  produce  "yes-or-no 
results  to  the  question  of  tampering"  ([7])  and  therefore  they  are  not  useful  in  all  possible 
cases.  The  big  question  here  is  what  kinds  of  changes  need  to  be  detected  and  what 
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changes  are  of  no  interest  to  us.  A  eonsiderable  amount  of  researeh  has  been  eondueted  in 
the  area,  in  an  effort  to  produee  a  watermark,  whieh  would  be  fragile  to  eertain  types  of 
modifications  and  resistant  to  others.  This  would  allow  the  detection  of  particular  types  of 
tampering,  for  example  the  use  of  Adobe  Photoshop  to  add  a  non-existing  objeet  to  the 
image,  while  others  sueh  as  JPEG  compression  should  go  undeteeted. 

Other  qualities  generally  required  from  a  watermark  stem  from  the  type  of  the 
applieation.  For  example  in  digital  sound  or  images  it  would  be  preferable  for  the 
watermark  to  be  impereeptible  to  the  human  senses  (ear  or  eye)  so  that  the  quality  of  the 
marked  image  is  not  eompromised. 

3,  Terminology 

a.  Public  and  Private  Watermarking 

In  the  literature  there  have  been  several  approaehes  to  this  issue.  Petiteolas 
et  al.  in  [1]  define  as  private  watermarking  systems  those  that  require  at  least  the  original 
image  for  deeoding.  The  authors  of  the  paper  further  define  the  Type  I  and  Type  //private 
watermarking  systems.  As  Type  I  they  eharaeterize  those  systems  that  base  their 
deteetion  proeess  on  the  possibly  marked  image  and  an  exaet  eopy  of  the  original  one. 
The  Type  II  systems  on  the  other  hand,  require  also  the  watermark  for  the  deeoder.  One 
more  eategory,  the  Semiprivate  watermarks,  is  also  mentioned.  Publie  marking  requires 
neither  the  original  image  nor  the  watermark.  Consequently  it  is  a  more  ehallenging 
seheme  but  the  deeoder  results  are  expeeted  to  be  poorer  beeause  of  the  small  amount  of 
information  that  is  available  throughout  the  proeess. 

Cox  et  al.  in  [4]  seem  to  put  the  two  terms  in  a  more  general,  though  also 
more  eomplieated,  perspective.  According  to  them,  in  both  oases  the  world  oan  be 
divided  into  a  group  of  trusted  individuals,  and  the  public,  who  are  assumed  to  be 
potential  adversaries.  In  private  watermarking  the  public  has  no  access  to  the 
watermarking  data  whatsoever.  In  publie  watermarking  however,  the  publie  is  only 
allowed  to  detect  the  watermarks.  The  way  the  terms  are  used  here,  refer  to  the  seourity 
requirements  of  the  applieation.  Similarly  the  same  terms  oan  be  used  to  desoribe 
watermarking  algorithms  and  as  sueh  they  desoribe  algorithms  that  fulfill  the 
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corresponding  security  requirements.  The  authors  admit  that  in  that  sense  the  usage  of  the 
terms  public  and  private  is  ambiguous. 

We  will  follow  the  definition  given  by  Petitcolas  ([1]),  which  is  accepted 
in  most  publications.  In  many  publications  the  term  public  is  used  interchangeably  with 
blind. 

Blind  watermarking  appears  to  have  more  applications  and  to  be  much 
easier  to  use.  You  need  to  pass  only  the  tested  image  through  the  decoder.  Private 
watermarking  algorithms,  since  they  need  at  least  the  original  image,  they  have  to  be 
supported  by  higher  seeurity  requirements.  On  the  other  hand,  the  development  of  a 
private  watermarking  algorithm  should  also  be  generally  simpler  and  the  results  are 
expected  to  be  signifieantly  better. 

The  blind  watermarking  techniques  developed  so  far  do  not  seem  to  be 
adequately  effective,  but  both  subjects  are  currently  under  research  and  we  should  expect 
better  results  in  the  near  future. 

b.  Robust  and  Fragile  Watermarks 

The  term  robust  watermark  describes  those  watermarks  remain  detectable 
within  an  objeet  in  spite  of  significant  levels  of  tampering  of  all  kinds.  The  detection  of 
the  watermark  comes  down  to  the  determination  of  the  probability  that  the  watermark  is 
present  in  the  object.  In  other  words  this  is  a  measure  of  how  confident  we  are  that  the 
tested  objeet  is  indeed  marked.  Even  when  the  detector  gives  a  yes-no  answer,  in  the 
general  case,  this  results  from  the  comparison  of  the  caleulated  probability  with  a 
predetermined  threshold.  However,  when  an  object  is  tampered  with,  it  is  automatically 
modified  from  the  original,  and  in  that  sense  its  quality  is  degraded.  Whether  this 
degradation  can  be  detected  or  not  by  the  human  sensors,  is  the  question  that  needs  to  be 
asked.  Therefore,  we  can  define  some  limits  for  the  maximum  required  robustness  of  the 
embedded  watermark.  The  limits  are  set  to  the  point  where  the  object  is  subjected  to  so 
much  tampering  for  the  removal  of  the  mark  that  the  results  not  only  can  be  detected  by 
the  human  sensors,  but  also  its  quality  becomes  very  low  to  offer  any  benefit  to  an 
attacker.  In  reality  the  situation  is  mueh  more  complicated  because  for  each  different  kind 
of  tampering  the  limits  described  above  are  different.  To  exhaust  all  possible  attacks 
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(different  kinds  of  tampering)  and  thus  set  a  final  limit  that  would  cover  all  the  cases  if 
not  impossible,  it  is  not  an  easy  task. 


A  fragile  watermark  has  the  purpose  to  confirm  that  the  object  has  not 
been  tampered  with.  In  cryptography  the  same  problem  has  been  studied  extensively  and 
the  most  well  known  solution  is  the  creation  of  a  digital  signature.  In  that  sense  the 
digital  object  is  processed  through  a  Hash  Function  [5].  A  Hash  Function  "produces  a 
reduced  form  of  the  body  of  data  such  that  most  changes  to  the  data  will  also  change  the 
reduced  form  In  particular  a  cryptographic  hash  function,  uses  a  cryptographic  function 
as  part  of  the  hash  function.  The  sender  in  this  case  would  evaluate  the  hash  function  of 
the  data  and  send  both  the  data  and  the  hash  value  through  the  communication  channel.  A 
legitimate  recipient  should  be  in  hold  of  the  cryptographic  algorithm.  He  should  decrypt 
both  the  data  and  the  hash  value  and  then  pass  the  data  through  the  same  hash  function. 
By  comparing  the  computed  hash  value  with  the  value  that  was  transmitted  to  him  by  the 
sender,  he  can  verify  that  the  data  were  received  as  sent.  An  intruder  may  be  able  to 
modify  the  data,  or  the  hash,  on  their  way  over  the  channel.  However,  since  he  has  only 
access  to  encrypted  information,  it  is  unlikely  that  he  could  modify  both  in  such  a  way 
that  they  would  match  again. 

As  explained  in  one  of  the  examples  given  in  sub-section  C.2  a  watermark 
that  potentially  exhibits  selective  robustness,  generally  called  fragile  watermark,  is 
required  for  tamper-proofing  purposes.  Again,  the  development  of  this  kind  of  watermark 
faces  serious  problems.  Except  from  the  pure  implementation  issues  that  include  the 
several  different  cases  that  need  to  be  examined,  there  is  also  need  for  some  limits  to  be 
set.  These  will  define  the  cases  where  the  watermark  should  be  robust  and  the  cases 
where  it  should  be  fragile.  The  lines  are  also  in  this  case  unclear  and  therefore  difficult  to 
be  firmly  established. 

c.  Fingerprinting 

The  term  describes  the  watermarks  that  are  used  as  a  serial  number  on  the 
copies  of  an  object.  They  are  like  a  fingerprint  of  the  copy.  The  situations  that  dictate  the 
need  for  their  development  are  also  described  in  sub-section  C.2.  The  primary  qualities  of 
the  fingerprints  are  robustness  against  attacks  and  uniqueness  for  each  different  copy  of 
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the  same  objeet.  Aecording  to  [4],  fingerprinting  refers  sometimes  to  the  practice  of 
extracting  inherent  feature  vectors  that  uniquely  identify  the  content. 

D,  IMAGE  WATERMARKING  TECHNIQUES 

The  driving  force  for  the  booming  of  the  watermarking  research  was,  as  already 
explained,  partially  the  Internet  users,  who  are  in  need  to  secure  their  multimedia 
products  that  are  available  on  the  internet,  and  also  the  industry  of  musical  CDs  and 
DVDs  that  are  even  more  desperate  to  protect  their  intellectual  property  and  secure  their 
profits.  There  is  demand  for  all  kinds  of  watermarks.  In  our  research  we  will  deal  with 
digital  images.  The  amount  of  research  in  this  field  (image  watermarking)  is  larger 
compared  to  other  fields  and  this  is  partially  due  to  the  large  amount  of  digital  images 
that  are  available  in  the  Internet.  There  are  two  main  embedding  techniques  different  in 
principle:  one  that  involves  embedding  in  the  space  domain,  and  the  other  that  uses 
instead  a  transform  domain.  In  [3],  [8],  and  [9]  the  authors  provide  an  overview  of  some 
of  the  significant  work  in  digital  watermarking  involving  different  embedding 
approaches. 

1,  Space  Domain  Watermarking 

The  space  domain  techniques  are  generally  considered  more  susceptible  to  the 
various  kinds  of  attacks.  However  these  techniques  were  implemented  first,  and  there  is 
still  research  going  on  in  the  area,  though  not  as  intense  as  in  the  transform  domains. 
Space  domain  techniques  can  be  chosen  for  low  cost  schemes  requiring  low  complexity 
and  small  computational  overhead. 

The  early  space  domain  watermarking  techniques  were  not  particularly  efficient. 
One  of  the  most  primitive  ideas  was  embedding  in  the  Least  Significant  Bit  of  the  pixel 
values.  This  technique  is  generally  easy  to  detect  and  thus  not  much  sophistication  is 
required  to  remove  the  watermark.  The  space  domain  approach  has  evolved  thereafter 
and  methods  have  been  proposed  that  are  considerably  more  effective. 

In  [10]  the  authors  propose  the  "Patchwork"  method  and  the  "Texture  Block 
Coding".  In  the  former  randomly  selected  pairs  of  pixels  (a„  Pi)  are  used  to  hide  1  bit  of 
watermark.  The  value  of  a,  is  increased  by  1  and  the  value  of  Pi  is  decreased  by  1 .  For 
this  method  to  work,  some  statistical  properties  should  be  satisfied.  The  latter  involves 
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copying  one  image  texture  bloek  to  another  area  in  the  image  with  similar  texture.  For  the 
reeovery  of  the  watermark  the  autoeorrelation  funetion  is  computed.  This  method  has 
proved  to  be  suffieiently  robust  to  several  kinds  of  distortion. 

In  [11]  another  teehnique  is  proposed.  The  authors  use  a  binary  watermark  with 
equal  numbers  of  ones  and  zeros,  whieh  has  the  same  size  as  the  original  image.  In  half 
of  the  image  pixels  a  binary  one  is  embedded  by  ehanging  the  pixel  number  by  an  integer 
value  k,  whieh  is  the  same  for  all  the  pixels.  Hypothesis  testing  is  used  for  the  watermark 
detection  and  the  method  seems  to  behave  well  in  down-sampling  followed  by  up¬ 
sampling,  and  JPEG  eompression  with  eompression  ratios  up  to  1:4. 

A  somewhat  improved  version  of  this  idea  is  proposed  in  [12].  The  image  is 
divided  into  non-overlapping  8x8  bloeks.  The  bloeks  where  the  mark  will  be  embedded 
are  seleeted  pseudorandomly.  To  eaeh  seleeted  bloek  a  pseudorandom  binary  8x8  bloek 
with  equal  number  of  ones  and  zeros  is  assigned.  To  embed  a  bit  1  the  pattern  is  added  to 
the  bloek  and  to  embed  a  zero  the  same  pattern  is  subtraeted  from  the  bloek.  Then  the 
differenee  between  the  mean  value  of  the  image  pixels  that  eorrespond  to  a  0  in  the 
pattern  is  subtraeted  from  the  mean  value  of  the  pixels  that  eorrespond  to  a  1 .  The  same 
ealeulations  are  repeated  for  the  JPEG  eompressed  counterpart  of  the  image.  If  a  1  is 
embedded  the  differenees  from  both  the  original  and  the  eompressed  version  need  to 
exeeed  a  threshold  T.  If  a  0  is  embedded  both  differenees  have  to  be  below  0.  If  this 
requirement  is  not  met  the  pattern  is  iteratively  added  or  subtraeted  until  the  eondition  is 
met.  This  method  is  partieularly  designed  for  JPEG  eompression  and  aeeording  to  the 
results  presented,  it  seems  to  provide  suffieient  robustness. 

Kutter  et  al.  ([13])  attempt  to  embed  a  watermark  in  the  spaee  domain  using  only 
the  blue  image  eomponent  in  an  RGB  eolorspaee,  in  order  to  maximize  the  watermarking 
strength  while  keeping  the  visual  artifaets  minimal. 

2,  Transform  Domain  Watermarking 

There  have  been  several  attempts  by  the  researeh  eommunity  to  investigate  the 
watermarking  performanee  in  different  transform  domains.  The  basie  benefit  from  a 
transform  domain  teehnique  is  that  by  ehoosing  a  framework  that  matehes  the  eurrent 
eompression  standards,  the  watermarking  algorithm  can  be  designed  to  avoid  embedding 
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in  the  coefficients  that  are  normally  discarded  or  severely  quantized  during  compression. 
In  this  way  we  can  ensure  robustness  to  this  particular  kind  of  compression  ([14],  [15]). 

In  [16],  to  start  from  a  rather  unusual  approach,  the  authors  use  the  Fresnel 
transform  to  provide  the  embedding  domain.  The  advantage  of  this  approach  is  that 
several  embedding  plains  exist  in  the  Fresnel  domain  according  to  the  various  distance 
parameters  thus  providing  many  embedding  channels.  This  work  seems  to  give  good 
results  against  certain  geometric  transformations  and  filtering  but  there  is  no  indication 
whatsoever  of  its  performance  against  any  type  of  compression.  In  addition,  no  follow-up 
work  has  been  observed  in  the  literature. 

In  reference  [17]  the  authors  propose  embedding  in  the  DFT  domain.  In  particular 
they  select  to  embed  the  watermark  using  the  phase  of  the  DFT  since  it  appears  to  be 
more  important  than  the  amplitude  of  the  DFT  values  for  intelligibility  of  an  image.  In 
other  references  ([18],  [19],  [20],  [21])  the  amplitude  of  the  DFT  is  also  used. 

In  reference  [22]  the  authors  propose  a  spread  spectrum  embedding  technique, 
which  uses  the  DCT  domain  as  the  embedding  domain.  Its  innovation  was  how 
communication  concepts  such  as  spread  spectrum  can  be  applied  to  watermarking,  and 
that  the  watermark  can  be  embedded  in  the  perceptually  significant  portion  of  the  image. 
In  spread  spectrum  communications,  one  transmits  a  narrowband  signal  over  a  much 
larger  bandwidth  such  that  the  signal  energy  present  in  any  frequency  is  undetectable. 
Similarly  here,  the  watermark  is  spread  over  many  frequency  bins  so  that  the  energy  in 
every  one  bin  is  very  small  and  therefore  unnoticeable.  This  idea  can  be  applied  to 
different  transform  domains.  When  the  DCT  is  used,  the  transform  is  performed  on  the 
whole  image  and  the  watermark  is  embedded  in  the  lowest  frequency  coefficients 
(excluding  the  DC  component).  As  influential  as  this  work  may  be,  it  has  not  yet 
produced  the  breakthrough  method  that  the  watermarking  community  is  expecting. 

A  different  idea  is  presented  by  Podilchuk  et  al.  in  [23],  [24].  There,  the  concept 
of  the  Just  Noticeable  Difference  (JND)  is  used.  The  JND  thresholds  have  been  used 
successfully  in  audio  compression  and  in  [24]  the  authors  were  the  first  to  introduce  the 
same  concept  to  digital  watermarking.  In  essence,  the  JND  threshold  determines  the 
maximum  level  of  distortion  that  will  be  transparent  to  the  human  visual  system  (HVS). 
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According  to  the  authors  there  are  three  different  properties  of  the  HVS  that  determine 
these  thresholds  and  need  to  be  taken  into  account  when  building  a  model:  (a)  Frequency 
sensitivity,  which  describes  the  human  eye’s  sensitivity  to  frequency  gratings  at  various 
frequencies,  and  provides  a  basic  visual  model  that  depends  only  on  viewing  conditions 
and  is  independent  of  the  content  of  the  image;  (b)  Luminance  sensitivity,  which  is  a  non¬ 
linear  function  for  the  HVS,  and  measures  the  effect  of  the  detectability  threshold  of 
noise  on  a  constant  background;  and  (c)  Contrast  masking,  which  refers  to  the 
detectability  of  one  signal  in  the  presence  of  another  signal.  An  attempt  to  incorporate  the 
JND  models  to  the  work  that  is  presented  in  this  paper  will  be  left  for  future  work.  This 
concept  is  applied  to  both  the  DCT  and  the  Wavelet  domain. 

The  authors,  Piva  et  al,  have  also  worked  on  a  DCT-based  method  that  exploits 
the  masking  characteristics  of  the  HVS  [25].  The  watermark  used  is  a  pseudorandom 
sequence  of  N  real  numbers  with  normal  distribution  and  the  method  appears  to  be 
effective  with  respect  to  JPEG  compression  median  filtering  and  some  geometric 
distortions. 

The  Wavelet  domain  appears  also  to  be  an  appealing  embedding  domain.  One 
reason  being  that  it  is  included  in  the  JPEG  2000  standards.  Therefore  wavelet-besed 
watermarking  methods  can  be  applied  to  provide  protection  against  JPEG  2000 
compression.  Also  the  wavelet  domain  can  be  used  as  a  computationally  efficient  version 
of  the  frequency  models  for  the  HVS  ([26]). 
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III.  DCT  DOMAIN  TECHNIQUES 


A,  THE  DISCRETE  COSINE  TRANSFORM 

The  Discrete  Cosine  Transform  is  a  key  element  for  JPEG  compression  and  as 
such  the  related  theory  is  important  for  our  research.  The  concept  is  well  explained  in 
reference  [27].  The  DCT  is  a  linear  transform  and  therefore  we  will  briefly  introduce  the 
linear  transforms  first. 

1.  Linear  Transforms 

Generally  in  a  linear  transformation  we  derive  a  sequence  from  a  sequence 

{  Zn  }  based  on  the  equation 


M-\ 

Wn=Y.^i-CCn,-  (3-1) 

Equation  3.1  is  referred  to  as  the  forward  transform.  The  original  sequence  can  be 
recovered  from  the  inverse 


M-\ 

%n=Y.^rPn,l-  (3.2) 

/-o 

We  can  get  the  same  results  using  a  matrix  representation 

¥  =  ^-X  (3.3) 

Z  =  '^-W,  (3.4) 

where  X  =  {Xo, =  and,  A,  B  are  MxM  matrices  with 

[A],,  j  =  a  I  j ,  [b],  j  =  - .  The  forward  and  inverse  transform  matrices  A  and  B  ,  are 

inverse  of  each  other,  and  therefore  A .  B  =  B .  A  =  /  ,  where  /  is  the  identity  matrix. 
Expanding  these  equations  in  two  dimensions  we  get  the  general  forward  linear  transform 
for  a  block  of  size  MxM 


'P 


k,l 


M-lM-l 

E  Ex, 


/=0  J=0 


(3.5) 
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A  two-dimensional  transform  is  called  separable  if  it  can  be  decomposed  into  a 
sequence  of  one-dimensional  transforms.  In  the  case  of  images  this  leads  to  a  transform 
of  the  rows,  followed  by  a  transform  of  the  columns,  or  vice  versa.  In  the  separable  case 
equation  3.5  can  also  be  represented  as 

=  (3.6) 

1=0  y=o 

For  a  matrix  representation  again  we  have 

'F  =  A-X-A^  (3.7) 

and  the  inverse 

X  =  (3.8) 

A  transform  is  called  orthonormal  if  the  inverse  of  the  transform  matrix  is  the 
same  as  its  transpose 


B  =  A"‘=A*^.  (3.9) 

Orthonormal  transforms  are  energy  preserving  or  in  other  words  the  sum  of  the  squares  of 
the  original  and  the  transformed  sequences  are  equal.  The  proof  is  in  reference  [27]  for 
the  case  of  one-dimensional  transform; 


M-\ 


’Zwf  =r'"-r  =  (AzrAz  =  /"A'"Az  (3.10) 

1=0 

For  an  orthonormal  transform  with  transformation  matrix  A ,  it  is  implied  that 


A*^  •A  =  A"'  •A  =  / 


(3.11) 


and  therefore 


M-\ 


*T  A  *T  A  *T  r  *T  V  1  9 

^  A  Az  =  Z  =  % 


(3.12) 


«=0 


Combining  equations  3.10  and  3.12  we  get 
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(3.13) 


M-\  M-\ 

n=0  n-0 

2,  The  Discrete  Cosine  Transform 

Among  the  several  transforms  that  have  been  used  in  digital  watermarking  we 
will  introduce  the  Discrete  Cosine  Transform,  which  is  the  basis  of  our  technique.  One 
can  find  sufficient  details  in  several  references  ([27],  [28],  [29]).  We  will  try  to 
encapsulate  the  necessary  information  here  in  order  to  make  it  easier  for  the  reader  to 
follow  the  development  of  our  research.  We  start  from  the  one-dimensional  case. 
a.  One-dimensional  DCT 

Given  an  array  V  of  M  numbers  V  =  [vq,Vj,...,v^_j],  let  us  define  the 
sequence  f'  =  [Vo,Vj,...,v^_j,v^_j,...,Vj,VQ]  where  V'  can  be  written  as 


V'[k]  = 


V[k],  0<k<M-\ 

V[2M-\-kl  M<k<2M-\ 


(3.14) 


We  take  the  2M-point  DFT  of  V' 


2M-\ 


M-1 


2M-1 


T[u]  =  Yv'Vky-^^"^^  =YV[k'\e-^*'‘l^  +  [2M  - 1  -  ,f)<u<2M-\  (3.15) 


k=Q 


k=M 


Now,  if  we  substitute  /  =  2M  -\-k  in  equation  3.15  we  get 


which  yields 


M-l  M-1 

T[u]  = 

k=0  1=0 


(3.16) 


M-1 

T[u]  =  e^"/^^J]V[k](e 

k=0 


-j7r(2k+\)uf2M  ^j7i{2k-\-\)uj2M 


),0<u<2M-\  (3.17) 


and  therefore 


T[u]  = 


M-1 

Xn^]-2 


k=Q 


/ 

cos 

V 


jn{2k  +  \)u  ^ 
2M  y 


(3.18) 
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As  a  consequence  of  this  result,  we  define  a  new  transform 


DCT{V[k])  =  C[u]  =  < 


ir[0],  u  =  0 


(3.19) 


From  the  preceding  analysis  we  realize  that  the  DCT  of  a  vector  V  is  derived  if  we  take 
its  mirror  image,  concatenate  the  two  sequences  to  obtain  a  2M-point  sequence,  and  then 
take  the  first  M  points  of  the  resulting  2M-point  DFT. 

The  DCT  pair  is  more  commonly  expressed  as 

M-l 

=  (3-20) 

VMS 


C[m]  = 


{lk  +  \)u7r'^ 
2M  y 


for  the  forward  transform,  and  for  the  inverse  it  can  be  shown  ([28])  that 


(3.21) 


vm  = 


— C[0]  + 

'm 


cos 

V 


{2k  +  \)u7r'^ 
2M  y 


(3.22) 


The  variable  in  the  argument  of  the  cosine  is  responsible  for  frequency 
adjustments,  and  the  factor  that  multiplies  the  cosine,  adjusts  the  amplitude  of  the 
function.  Clearly  the  IDCT  is  the  summation  of  cosines  of  different  frequencies  and  the 
DCT  coefficients  represent  the  amplitude  of  each  cosine  function. 

b.  Two-dimensional  DCT 

The  two-dimensional  DCT  is  defined  as  separable  transform: 


T[i,  y]  =  c{i,  j)  ^  A  cos| 

x=0 


{2y  +  \)i7r']  (i2x  +  \)j7r 


2M 


cos 


2M 


(3.23) 


where 
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The  IDCT  for  the  two-dimensional  case  is 


=  0 

otherwise. 


ny,x]  =  ^'^c(i,J)T[i,j]cos 


1=0  j=0 


(2y  +  l)/;r 
2M 


(2x  + 1) 
2M 


(3.24) 


Unlike  the  DFT,  the  DCT  is  real  and  it  is  well  known  that  compared  to  the  DFT  it  is 
substantially  better  in  energy  compaction  for  most  correlated  sources.  With  the  DCT  we 
avoid  the  large  coefficients  for  the  high  frequency  components  that  are  produced  in  the 
DFT  due  to  the  discontinuities  at  the  boundaries. 

B,  THE  JOINT  PHOTOGRAPHIC  EXPERTS  GROUP  (JPEG)  STANDARD 

The  JPEG  standard  is  one  of  the  most  widely  used  standards  for  lossy  image 
compression  and  it  offers  a  very  good  data  compression  rate.  The  standard  proposed  by 
the  Joint  Photographic  Expert  Group  is  based  on  the  two-dimensional  DCT  and  its 
components  can  be  depicted  in  figure  3.  The  JPEG  standard  defines  three  lossy 
compression  modes,  namely,  the  baseline  sequential  mode,  the  progressive  mode,  and 
the  hierarchical  mode.  The  main  difference  between  these  modes  is  the  way  in  which  the 
DCT  coefficients  are  transmitted.  The  baseline  sequential  mode,  also  called  baseline 
mode  for  short,  is  the  simplest  of  the  modes  and  is  required  to  be  present  in  any  case 
(even  if  other  modes  are  used  the  baseline  mode  provides  the  default  decoding  capability 
[30]). 
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Figure  3.  Block  diagram  of  the  JPEG  compression. 

1.  The  Transform 

The  transform  used  in  the  JPEG  standard  is  the  DCT  transform  described  earlier. 
As  a  first  step  the  value  2^“^  is  subtracted  from  each  pixel  value,  where  P  is  the  bit 
allocation  per  pixel.  Eor  the  case  of  8-bit  per  pixel  images,  the  pixel  values  range  from  0 
to  255  and  2^“'  =  128 .  This  means  that  after  the  subtraction  the  pixel  values  are  in  the 
range  [-128,  127].  This  level  shifting  reduces  the  DC  offset  of  the  transformation  (i.e.  the 
value  of  the  DC  coefficient)  but  has  no  other  effect  whatsoever  in  the  results.  The  JPEG 
standard  dictates  that  the  image  is  divided  into  non-overlapping  8x8  blocks  and  each 
block  is  then  DCT  transformed.  In  case  that  the  image's  rows  or  columns  are  not 
multiples  of  eight,  the  last  row  or  column  is  replicated  until  the  image  reaches  a  multiple 
of  eight  size.  Any  additional  rows  or  columns  are  discarded  after  decoding. 

A  more  convenient  method  of  expressing  the  DCT  is  in  the  form  of  matrix 
operations.  In  this  case  the  forward  DCT  transform  is 

T  =  LVL\  (3.25) 

and  the  inverse  DCT  is 

V  =  L^TL,  (3.26) 

where  L  is  given  by  equation  3.27. 
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L[i,n=< 


1 

M 

Vm 


cos 


(2j  +  l)i7i 
2M 


i  =  0,0<  j<M-\ 


0<  j<M-\ 


This  last  form  is  particularly  helpful  for  the  implementation  of  the  algorithm  in  eomputer 
programs  and  will  be  used  indeed  in  our  development. 

2.  Quantization 


The  next  step  after  the  DCT  transform  is  quantization.  In  any  ease  quantization  is 
a  lossy  proeess  and  introduees  distortion  to  the  signal.  It  is  obviously  in  our  interest  to 
maintain  the  distortion  to  a  minimum. 


The  distortion  introdueed  by  quantization  is  measured  by  a  distanee  metrie.  The 
most  widely  used  is  the  Mean  Square  Error  (MSE): 


1  V-lM-l 

D  =  £[(.r-i)^]  =  — -y)= ,  (3.28) 

MIN  1=0 

whieh  applies  to  eaeh  MyN  bloek.  If  we  are  interested  in  the  size  of  the  error  relative  to 
the  signal,  we  ean  define  the  Signal-to-Noise  ratio  (SNR),  as 


SNR(dB)  =  1 0  log  10  — ,  (3 .29) 

where  S  is  the  average  square  value  of  the  souree  output.  A  measure  of  the  error  relative 
to  the  peak  value  of  the  signal  is  the  Peak  Signal-to-Noise  ratio  (PSNR),  whieh  is 

defined  as 


PSNRidB)  =  mog,o^.  (3.30) 

The  distortion  introdueed  by  quantization  is  inversely  proportional  to  the  step  size,  whieh 
in  turn  depends  on  the  bits  alloeated  per  eoeffieient.  Sinee  the  amount  of  information 
eonveyed  by  eaeh  eoeffieient  varies,  it  is  reasonable  to  alloeate  different  number  of  bits 
to  eaeh  eoeffieient,  with  more  bits  to  be  alloeated  to  the  eoeffieients  that  earry  more 


(3.27) 
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information.  As  a  measure  of  the  amount  of  information  that  is  earried  by  the  DCT 
eoeffieients  we  ean  use  the  varianee  of  the  eoeffieients.  Thus  eoeffieients  with  larger 
varianee  are  assigned  more  bits  then  eoeffieients  with  smaller  varianee. 

The  JPEG  standard  uses  an  8x8  table  ealled  quantization  table.  The  same 
quantization  table  is  used  for  all  the  image  bloeks.  The  elements  of  the  quantization  table 
determine  the  step  size  that  is  used  for  the  quantization  of  eaeh  eoeflioient  in  an  8x8 
bloek.  The  JPEG  standard  allows  different  step  sizes  to  be  ehosen  for  different 
eoeffieients.  This  implies  that  different  amount  of  distortion  is  introdueed  for  different 
frequeneies.  In  general  the  higher  frequeney  eoeffieients  are  more  severely  distorted  with 
the  use  of  greater  step  size.  The  deeision  of  the  relative  size  of  the  step  size  is  based  on 
repeated  experiments  that  take  into  aeeount  the  human  psyeho-visual  system  and  the  way 
the  distortions  in  different  frequeneies  are  pereeived  by  the  human  eye.  In  general  errors 
in  the  higher  frequeney  eoeffieients  are  more  easily  deteetable  and  thus,  in  these 
frequeneies  we  use  larger  step  size. 

The  JPEG  standard  does  not  speeify  the  exaet  quantization  matriees  that  should  be 
used,  however,  it  suggests  two  quantization  matriees,  one  for  the  luminanee  eomponents, 
and  one  for  the  ehrominanee  eomponents  that  have  proven  to  provide  exeellent  results. 
One  ean  ereate  a  eustomized  quantization  matrix  that  better  suits  one's  needs.  Tables  1 
through  3  show  examples  of  quantization  matriees. 

Table  1.  The  JPEG  proposed  luminanee  Q-table. 


16 

11 

10 

16 

24 

40 

51 

61 

12 

12 

14 

19 

26 

58 

60 

55 

14 

13 

16 

24 

40 

57 

69 

56 

14 

17 

22 

29 

51 

87 

80 

62 

18 

22 

37 

56 

68 

109 

103 

77 

24 

35 

55 

64 

81 

104 

113 

92 

49 

64 

78 

87 

103 

121 

120 

101 

72 

92 

95 

98 

112 

100 

103 

99 
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Table  2.  The  JPEG  proposed  ehrominance  Q-table. 


17 

18 

24 

47 

99 

99 

99 

99 

18 

21 

26 

66 

99 

99 

99 

99 

24 

26 

56 

99 

99 

99 

99 

99 

47 

66 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

99 

If  C[i,j]  is  the  transform  of  an  8x8  luminance  image  block,  its  quantized 
counterpart  C^[i,j]  is  given  by 


c,[i,n  = 


c[i,n 


(3.31) 


where  Qi\i,j^  is  the  luminance  quantization  table,  and  is  the  rounding  division  to  the 
nearest  integer.  Then  is  processed  through  a  decoder  to  produce  the 

reconstructed  quantized  coefficients  Cg[i,7] 


=  (3.32) 

A  quality  factor  q  is  normally  used  ([30])  to  control  the  degree  of  quantization. 
This  factor  lies  in  the  range  [1,  100]  and  it  represents  the  quality,  expressed  in  percent,  of 
the  quantized  image  compared  to  the  original  one.  A  quantization  factor  c  is  then  given 
by 


V 


1<^<50 

50<q<99 


(3.33) 
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Table  3. 


The  luminance  Q-table  proposed  by  the  lAHS  Incorporation. 


8 

6 

5 

8 

12 

20 

26 

31 

6 

6 

7 

10 

13 

29 

30 

28 

7 

7 

8 

12 

20 

29 

35 

28 

7 

9 

11 

15 

26 

44 

40 

31 

9 

11 

19 

28 

34 

55 

52 

39 

12 

18 

28 

32 

41 

52 

57 

46 

25 

32 

39 

44 

52 

61 

60 

51 

36 

47 

48 

49 

56 

50 

52 

50 

The  standard  JPEG  quantization  tables  of  Table  3.1  and  3.2  are  used  directly  for 
^=50%.  The  same  tables  are  multiplied  by  c  to  give  the  different  quality  (compression) 
levels.  For  100%  quality,  g=100,  that  is  lossless  compression,  and  all  the  elements  of 
c-Ql  are  set  to  1.  A  quantization  example  is  given  in  figure  4. 


Figure  4.  fishingboat  original  (left)  and  quantized  with  quality  factor  5%  (right) 
(courtesy  of  the  Signal  and  Image  Processing  Institute  at  the  University  of 
Southern  California). 
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3.  Coding 

After  the  DCT  transform  and  the  quantization,  further  lossless  compression  is 
achieved  by  proper  encoding.  In  each  8x8  block  of  quantized  image  coefficients  the  DC 
component  is  the  top-left  coefficient,  while  the  highest  frequency  components  are 
towards  the  bottom-right.  In  the  general  case  after  DCT  the  low  frequency  coefficients 
(top-left  except  the  DC)  have  larger  values  as  opposed  to  the  low-frequency  coefficients 
(bottom-right)  that  have  smaller  values.  After  quantization  many  of  the  coefficients 
towards  the  higher  frequencies  become  zero.  In  order  to  group  as  many  quantized  zero- 
value  coefficients  together  to  produce  longest  runs  of  zero  values,  the  AC  coefficients  are 
encoded  using  a  zigzag  path  (figure  5).  According  to  the  JPEG  standard  the  DC  and  AC 
coefficients  are  encoded  separately. 

a.  DC  Encoding 

The  DC  coefficients  tend  to  vary  slightly  between  successive  blocks. 
Therefore,  only  the  difference,  DIFF,  between  the  current  and  the  previous  block  is 
encoded.  For  the  first  block,  the  previous  block  value  is  set  to  zero.  The  potential  value  of 
DIFF  varies  in  the  range  [-2040,  2040],  however,  in  most  cases  DIFF  takes  relatively 
small  values. 


DC 

7 

7^ 

7^ 

7^ 

7 

/ 

/ 

/ 

/ 
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/ 

/ 
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/ 

b 
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Figure  5. 


The  zigzag  path  on  an  8x8  bloek. 


The  number  of  bits  S,  that  is  required  to  represent  DIFF  is  1  to  11. 
Additionally  although  the  differenee  of  0  requires  1  bit  to  express,  it  is  represented  as  a 
speeial  ease  with  zero  bits.  Thus  5=0  to  1 1  and  can  be  broken  into  12  categories.  Table  4 
shows  the  value  of  S  for  the  different  DIFF  values.  Now  the  compacted  values  are 
encoded  with  the  use  of  Huffman  code  ([37]).  The  codeword  consists  of  three  parts;  the 
code  for  S  as  obtained  from  Table  5;  one  sign  bit,  1  for  positive  and  0  for  negative;  and 
the  5-1  least  significant  bits  of  the  DIFF  value.  If  the  DIFF  value  is  negative,  in  the  third 
part  of  the  codeword  we  use  instead  the  I's  complement  of  the  5-1  least  significant  bits  of 
the  DIFF  value.  In  the  special  case  of  5=0  the  codeword  consists  of  only  one  part,  the 
Huffman  code  for  5  as  obtained  from  Table  5. 


Table  4.  The  DIFF  categories. 


5 

Difference  values 

0 

0 

1 

-1,1 

2 

-3, -2, 2, 3 

3 

-7, -6, -5, -4, 4, 5, 6, 7 

4 

-15,. ..,-8,8,. ..,15 

5 

-31,. ..,-16,16,. ..31 

6 

-63,... ,-32,32,... ,63 

7 

-127,. ..,-64,64,. ..,127 

8 

-255,. ..,-128,128,. ..,255 

9 

-511, ...,-256,256,. ..,511 

10 

-1023,. ..,-512,512,. ..,1023 

11 

-2047,. ..,-1024,1024,. ..,2047 

b.  AC  Encoding 
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For  the  encoding  of  the  AC  coefficients  the  zigzag  path  described  earlier 
comes  into  use.  The  idea  is  that  the  quantization  produces  large  blocks  of  successive 
zeros  especially  towards  the  high  frequencies.  Here  we  use  a  combination  of  Huffman 
coding  and  Run-Length  coding.  As  we  follow  the  zigzag  path,  each  non-zero  coefficient 
is  described  by  a  composite  R/S  symbol;  i?  is  a  4-bit  element  specifying  the  number  of 
zeros  between  the  last  non-zero  and  this  coefficient;  and  S  is  the  number  of  bits  that  are 
required  to  express  the  non-zero  coefficient  as  in  Table  6. 

If  all  remaining  AC  coefficients  are  zero  the  End-of-Block  (FOB)  symbol 
is  set.  If  the  number  of  zeros  in  a  run  exceeds  16  then  the  zero  count  recommences. 
Usually  two  hexadecimal  symbols  are  used  to  represent  the  composite  R/S.  The  codeword 
is  again  completed  with  two  more  parts;  the  1-bit  sign,  and  the  5-1  last  significant  bits  of 
the  value.  These  are  used  in  the  same  manner  as  in  the  DC  case,  which  means  that  the 
third  part  is  substituted  by  its  I's  complement  in  the  case  of  a  minus  sign. 

Table  5.  The  Huffman  code  for  DIFF  values. 


DC  Luminance  DC  Chrominance 


5 

Length 

Codeword 

Length 

Codeword 

0 

2 

00 

2 

00 

1 

3 

010 

2 

01 

2 

3 

on 

2 

10 

3 

3 

100 

3 

no 

4 

3 

101 

4 

1110 

5 

3 

no 

5 

lino 

6 

4 

1110 

6 

111110 

7 

5 

lino 

7 

1111110 

8 

6 

111110 

8 

11111110 

9 

7 

1111110 

9 

111111110 

10 

8 

11111110 

10 

1111111110 
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11 


9 


111111110 


11 


11111111110 


Table  6.  The  AC  categories. 


5 

Coefficient  values 

1 

-1,1 

2 

■3,-2,2,3 

3 

-7,-6, 

,-5,-4,4,5,6,7 

4 

-15,. 

1 

00 

00 

5 

-31,.. 

.,-16,16,. ..31 

6 

-63,.. 

.,-32,32,... ,63 

7 

-127,.. 

.,-64,64,... ,127 

8 

-255,..., 

-128,128,. ..,255 

9 

-511,..., 

-256,256,. ..,511 

10 

-1023,..., 

-512,512,. ..,1023 
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IV.  A  NON-UNIFORM  WATERMARKING  ALGORITHM 


Through  the  course  of  our  research  we  tried  to  reproduce  some  basic 
watermarking  scheme  that  would  adequately  sustain  the  basic  attacks  of  cropping  and 
compression,  while  at  the  same  time  maintain  sufficient  transparency.  Towards  that  end 
we  formed  some  new  theoretical  concepts  for  the  development  of  a  new  algorithm.  Both 
the  concepts  and  the  algorithm  are  presented  in  this  chapter. 

A,  ANALYSIS  OF  THE  NEW  CONCEPTS 

1.  Center  of  Interest  Proximity  Factor 

We  first  processed  the  idea  of  rating  the  8x8  blocks  of  the  image  DCT 
coefficients.  The  motivation  for  this  approach  was  the  insufficient  performance  against 
cropping  that  was  evident  in  many  of  the  studied  schemes. 

We  assert  that  the  resistance  of  the  image  to  cropping  depends  heavily  on  the 
spatial  location  of  the  image  blocks  that  are  selected  for  embedding  the  watermark 
coefficients.  If  the  coefficients  are  embedded  on  portions  of  the  image  that  will  be  later 
cropped,  those  coefficients  will  be  permanently  lost.  It  is  important  to  note  that  by 
transforming  an  image  to  the  DCT  domain  in  blocks  of  8x8  pixels  (JPEG  standard),  the 
spatial  relation  of  each  of  the  blocks  of  DCT  coefficients  is  maintained. 

It  is  generally  correct  that  in  cases  of  commercially  used  images  there  is  a  Region 
of  Interest  (RI),  where  most  of  the  image  information  is  concentrated.  For  the  purpose  of 
our  analysis  for  each  given  image  we  determine  a  specific  point,  which  is  called  the 
Center  of  Interest  (Cl).  As  the  Cl  we  may  choose  either  the  center  of  the  image  {MI2,NI2 
for  an  M'xN  image),  or  any  other  point  of  the  image.  In  the  experiments  to  follow  as  the 
Cl  we  used  the  center  of  the  image.  Following  the  same  rationale  it  is  reasonable  to 
assume  that  anyone  who  would  try  to  crop  the  image  for  any  reason  (either  for  attacking 
our  watermarking  system  or  just  because  he  has  no  interest  in  the  whole  image),  he  would 
crop  some  portion  near  the  borders  of  the  image  maintaining  most  of  the  information  that 
is  carried  around  the  Center  of  Interest.  Similarly,  pie  type  cropping  (figure  6)  should 
probably  be  considered  impracticable  for  anyone  who  would  try  to  benefit  from  the 
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image.  Our  intention  is  to  develop  a  method  that  takes  into  aceount  the  significance  of  the 
region  of  interest  in  determining  the  8x8  blocks  where  the  watermark  will  be  embedded. 


Figure  6.  Peripheral  (left)  versus  pie  type  (right)  cropping  of  Lena  (courtesy  of 
the  Signal  and  Image  Processing  Institute  at  the  University  of  Southern 
California). 

For  each  8x8  block  of  the  cover  image  we  determine  the  Euclidean  distance 
r(m,n)  between  the  center  of  the  block  with  coordinates  {m,n),  and  the  Cl  (with 
coordinates  (M/2, M2)  if  the  center  of  interest  is  the  same  as  the  center  of  the  image).  This 
distance  is  then  normalized  over  the  diagonal  (i.e.  the  maximum  possible  distance  within 
the  image)  to  produce  a  normalized  value  rnorm,  where  rnorm  G  [0,1].  This  normalized 
distance  is  then  processed  through  a  transformer  with  characteristic  function/ 

/(rnorm)  =  -  —  •  tan  ^  (A:  •  (rnorm -—))  +  — ,  (4.1) 

Tt  3  2 

where  k  can  typically  vary  in  the  range  [10,25].  The  result  is  the  Center  of  Interest 
Proximity  Factor  (CIPF=/(rnorm)).  A  typical  distribution  of  the  CIPF  can  be  depicted  in 
figure  7. 
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Figure  7.  The  distribution  of  the  CIPF  over  the  8x8  blocks  of  a  256x256  image 
with  k=15.  The  x  and  y  axes  are  the  coordinates  of  the  image  blocks 
(32x32  blocks  in  an  256x256  image). 

2,  Complexity  Factor 

The  main  idea  is  to  embed  the  watermark  in  the  image  blocks  where  it  could  not 
be  detected  by  the  HVS.  In  the  literature  there  have  been  several  papers  addressing  this 
issue.  One  such  attempt  is  to  use  the  variance  of  the  image  blocks  in  the  space  domain  as 
a  measure  of  their  embedding  capacity  ([31]).  This  means  that  if  the  variance  of  a  block 
in  the  space  domain  is  higher,  we  can  embed  in  this  block  larger  watermark  coefficients, 
with  lower  probability  that  the  produced  distortion  will  be  detected  by  the  HVS.  We 
claim  that  this  is  not  quite  correct,  and  we  can  prove  the  validity  of  our  claims  with  a 
trivial  example.  In  figure  8,  both  blocks  (8x8)  have  the  same  number  of  black  and  white 
pixels.  In  spite  of  having  different  pixel  arrangement,  both  blocks  have  the  same  variance 
(0.2540)  in  the  space  domain.  We  can  obviously  tell  that  changing  any  one  pixel  on  the 
left  block  will  be  immediately  detected  by  the  human  eye,  whereas,  the  same  alteration 
on  the  right  block  (which  has  a  more  complicated  visual  pattern)  would  require  more 
thorough  observation  for  detection. 

Our  idea  is  to  weight  the  absolute  value  of  the  DCT  coefficients  of  an  image 
block  differently,  according  to  the  part  of  the  spectrum  that  they  describe,  and  then  add 
them  up  to  produce  a  Complexity  Factor  (CF)  for  each  block.  With  our  method  for  the 
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same  example  of  figure  8,  we  get  a  faetor  of  44.2044  for  the  left  bloek  against  a  faetor 
790.8275  for  that  on  the  right. 

In  our  method  we  have  exeluded  the  DC  eoeffieient  from  the  ealeulations.  The 
reason  is  that  the  DC  eoeffieient  represents  the  average  pixel  offset  rather  than  frequeney, 
and  therefore  should  be  ignored.  Thus,  we  ereate  the  veetor  weight={\,2,...,61>'\,  whieh  we 
use  to  weigh  the  absolute  value  of  the  DCT  eoeffieients  using  the  formula 


CF^  =  weight  • 


(4.2) 


where  D,  is  a  vector  (1x63)  containing  the  DCT  coefficients  of  the  block  of  the  image 
according  to  the  standard  zigzag  arrangement,  (•)  is  the  matrix  multiplication  operation, 
and  CFi  is  the  resulting  Complexity  Factor  for  that  block. 

B,  ENCODER 


The  encoder  is  described  in  principle  by  the  block  diagram  in  figure  9.  Both  the 
watermark  and  the  image  are  DCT  transformed  and  processed  through  the  embedding 
algorithm.  The  outcome  is  then  IDCT  transformed  and  normalized  to  compensate  for  any 
errors  that  exceed  the  allowed  limits  of  the  pixel  values 


Figure  8.  Two  binary  8x8  blocks  with  the  same  number  of  ones  and  zeros  but 
different  perceptual  characteristics. 
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1.  Priority  Coefficient 

In  Section  A  we  saw  how  the  CIPF  and  the  CF  are  calculated  for  each  image 
block.  Now,  we  introduce  a  new  coefficient,  the  Priority  Coefficient  {PC).  Each  image 
block  i,  is  associated  with  a  Priority  Coefficient  PCi.  The  PCi  is  defined  as  the  CIPFj, 
scaled  by  the  CF/. 

PC-  =  CIPF-  ■  CF- .  (4.3) 

The  image  blocks  are  sorted  according  to  descending  order  of  their  PCi,  to 
produce  the  sequence  of  blocks  B],  B2,  ...  ,  Bk,  {K  is  the  total  number  of  image  blocks). 
Blocks  that  come  first  in  the  sequence  are  less  likely  to  be  cut  off  after  cropping,  and 
have  larger  variance  in  the  lower  and  middle  frequency  coefficients  allowing  for  higher 
unnoticeable  distortion.  Thus,  they  are  capable  of  successfully  “hiding”  higher  watermark 
coefficients  or  in  other  words  they  have  larger  embedding  capacity. 

2,  Embedding  Algorithm 

In  each  block  of  the  image  DCT  coefficients  we  embed  a  certain  number  of 
watermark  DCT  coefficients.  In  order  to  preserve  transparency,  we  embed  a  small 
number  of  watermark  coefficients  in  each  8x8  image  block.  We  refer  to  the  number  of 
watermark  coefficients  that  are  embedded  in  one  image  block  as  the  embedding  size  (es). 
A  typical  embedding  size  is  2,  4,  or  8  watermark  coefficients  per  image  block.  This 
means  that  the  watermark  size  cannot  be  larger  than  m-K,  where  m  is  the  embedding  size 
(me  [2,4,8]),  and  K  is  the  total  number  of  8x8  blocks  in  an  image. 

We  tried  to  produce  a  scheme  that  embeds  the  watermark  coefficients  into  the 
image  blocks  in  the  most  efficient  way.  The  rationale  can  be  described  by  the  following 
rules: 


•  The  watermark  coefficients  with  higher  magnitude  should  be  embedded  in  the 
higher-rated  image  blocks. 

This  serves  two  purposes:  higher  magnitude  watermark  coefficients  are  in  the 
general  case  the  most  important  ones  and  as  such  they  need  higher  protection 
against  cropping;  additionally,  they  cause  greater  distortion  to  an  image  block 
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after  embedding,  and  as  already  explained  the  higher  rated  image  bloeks  are 
less  suseeptible  to  distortion. 

•  Not  all  the  higher  magnitude  watermark  coefficients  should  be  embedded  in 
one  image  block. 

Otherwise  the  distortion  in  that  bloek  will  be  severe  and  will  not  be  tolerated 
by  the  HVS. 

Based  on  these  rules  we  created  the  algorithm  illustrated  in  figure  10.  The  DCT 
coefficients  of  the  whole  watermark  are  sorted  according  to  descending  order  of 
magnitude  \ci,  C2,  ....  cl\,  where  L  is  the  total  number  of  watermark  coefficients.  They  are 
then  divided  into  m  groups  with  equal  number  of  coefficients  [ci,  C2,  ...  ,C(L/m)  \  C(L/m)+i,  ••• 

.  C(2L/m)  I  •••  I  C((m-i)L/m)+h  •••  ,cl\-  The  Coefficients  are  now  regrouped  to  form  the 
embedding  sets.  Each  set  contains  m  coefficients,  one  from  each  group.  The  first 
coefficient  from  the  first,  second,  ...  ,  m'^  group  form  the  first  embedding  set  esl=[ci, 
C(L/m)+i,  •••  ,  C((m-i)L/m)+i\,  thc  sccond  Coefficient  from  the  first,  second,  ...  ,  m^'^  group  form 
the  second  embedding  set  es2=[c2,  C(L/m)+2,  •••  ,  C((m-i)L/m)+2^,  and  so  on,  until  es(L/m)= 
[c(L/m),  C(2L/m),  •••  .  Cz,]-  Thc  rcsult  is  {L/m)  sets  sorted  according  to  descending  order  of 
embedding  weight  from  esl  to  es(L/m).  The  sets  that  come  first  in  the  list  require  image 
blocks  with  larger  capacity.  Therefore  we  embed  esl  to  Bl,  es2  to  B2  etc. 


Each  set  is  embedded  into  m  coefficients  of  the  corresponding  image  block 
following  the  formula; 


^i(xstart)  ^i{xstart)  ^  Si 

u'  =u.,  +«-C.- 

i{xstart  +  l)  i{xstart  +  l)  i2 


u'  =  M.,  +«-C. 

i{xstart  +  m-\)  i{xstart  +  m-\)  im 


(4.4) 


where  a  is  a  weighting  factor  that  typically  ranges  around  0.1,  Cy  is  the  coefficient  of 
the  embedding  set,  uy  is  the  /*  coefficient  on  the  zigzag  arrangement  of  the  block, 
and  u'ij  is  the  modified  image  coefficient  Uij  after  embedding.  Eor  example,  for  m=4,  cj 
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corresponds  to  C21.  The  index  xstart  denotes  the  value  of  j  in  Wy  where  the  embedding 
starts. 


Figure  9.  The  basie  eneoder. 

Normalization  may  be  used  as  the  last  step  that  takes  plaee  in  the  spaee  domain 
after  the  marked  image  eoeffieients  have  been  IDCT  transformed  and  have  produeed  the 
marked  image.  The  use  of  normalization  is  optional  and  the  eoneept  is  further  explained 
in  the  next  ehapter. 
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C.  DECODER  AND  DECISION  MAKING 

The  decoder  (figure  11)  works  in  reverse  order  and  requires  both  the  original 
image  and  the  watermark.  The  DCT  coefficients  of  the  test  image  are  subtracted  from  the 
DCT  coefficients  of  the  original.  At  this  stage  the  sorting  information  of  the  watermark 
coefficients  is  also  needed  to  correctly  reassemble  the  potentially  recovered  watermark. 
The  result  is  IDCT  transformed  to  produce  the  recovered  object,  Wr  (product  of  the 
decoding  process). 

The  decision  making  device  is  based  on  classical  detection  theory  ([32]).  The 
recovered  object  is  now  compared  to  the  original  watermark  by  calculation  of  the 
correlation  coefficient  p,  of  the  two: 
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(4.5) 


P  = 


ZZf^(i,jWr(i,J) 

I  j _ 

^zzwiijy?  -zzwriijy?  ’ 


V  '  7  '7 

where  W(i,j)  is  the  (ij)  pixel  of  the  original  watermark,  and  Wr(i,j)  is  the  (i,j)  pixel  of  the 
reeovered  object.  The  decoder  decides  whether  the  recovered  object  corresponds  to  an 
actual  watermark  or  not,  based  on  a  predetermined  threshold  T.  Higher  p  means  that  W 
and  Wr  are  highly  correlated  and  therefore  have  higher  similarity  to  each  other.  This  is 
interpreted  as  higher  confidence  that  the  processed  image  has  been  indeed  watermarked. 
In  the  case  where  W  and  Wr  are  independent,  p  is  normally  distributed  with  zero  mean 
([33]).  Therefore,  the  probability  of  p  exceeding  a  certain  threshold  can  be  directly 
obtained  from  the  normal  distribution.  The  threshold  can  be  accordingly  adjusted  to 
match  our  probability  of  detection,  Pd,  and  probability  of  false  alarm,  PpA,  requirements. 


Figure  1 1 .  The  decoder. 
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V.  IMPLEMENTATION  ISSUES 


In  this  chapter  we  deseribe  several  interesting  parts  of  our  researeh.  Not  all  of 
them  were  sueeessful,  and  not  all  of  the  sueeessful  ideas  of  this  ehapter  were  aetually 
ineorporated  in  the  model  presented  in  ehapter  four. 

A.  KEYING 

To  enhanee  the  seeurity  of  the  watermark  we  ean  use  a  unique  key.  The  key 
should  be  applied  direetly  on  the  watermark,  after  the  error  eorreetion  eode  if  any,  but 
before  any  other  proeessing.  In  the  ease  of  a  random  watermark  however,  the  use  of  the 
key  as  a  seeurity  feature  is  redundant  or  the  key  should  also  be  the  watermark  itself.  We 
ean  extend  spread  speetrum  teehniques  ([4])  to  watermarking  by  applying  a  key  with 
length  multiple  of  the  watermark. 

We  implemented  the  keying  feature  in  our  algorithm  using  the  following  proeess: 
We  started  from  an  MyN  grayseale  watermark,  W ,  with  pixel  values,  W\i,j~\ 
(1  <  /  <  M,  1  <  j  <  N),  integers  in  the  range  [0,  255].  Eaeh  pixel  is  translated  into  binary 
with  8  bits  per  pixel  {W^[i,j,k],\<i<M,\<j<N,\<k<S).  Essentially  we  now  have 
an  MxNxS  binary  matrix  lE^  .  An  equally  sized  binary  Key,  K[i,j,k] ,  is  also  produeed, 
and  the  two  are  XORed 

W,, [i,  j,  k]  =  W, [i,  j,  k]  ©  K[i,  j,k],  1  <  /  <  M,  1  <  7  <  A,  1  <  k  <  8 .  (5.1) 

The  binary  is  translated  baek  into  8-bit  integer  representation  to  produee  the 
keyed  watermark  IT^  .  The  eoneept  is  implemented  in  our  algorithm  with  two  funetions; 

keying  and  bitPlanes  (appendix  A).  The  former  produees  the  Key  and  performs  the  aetual 
XOR  operation  while  the  latter  deeomposes  the  watermark  into  8  binary  planes  (figure 
12)  with  the  plane  at  the  baek  eontaining  the  Eeast  Signifieant  Bits  (ESB)  of  eaeh  pixel 
value  that  has  been  translated  into  binary. 
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Figure  12.  The  bitPlanes  function  concept. 

B,  QUANTIZATION 

We  know  from  Chapter  III  that  in  JPEG  compression  the  coding  part  is  lossless 
and  the  errors  that  occur  during  the  process  are  introduced  by  the  quantization  element 
(quantization  noise).  In  order  to  test  the  performance  of  the  proposed  algorithm  against 
JPEG  compression  it  was  therefore  sufficient  to  reproduce  the  quantization  component  of 
the  compression  algorithm. 

Since  all  our  experiments  were  conducted  with  grayscale  images  we  used  the 
standard  JPEG  luminance  quantization  table.  In  essence  equations  3.31  and  3.32  were 
modified  to 


c,[i,n  = 


c[i,n 


(5.2) 


=  (5.3) 

where  c  is  given  by  equation  3.33  and  is  the  standard  JPEG  luminance  quantization 
table  of  Table  3.1. 
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C.  NORMALIZATION 

In  the  case  where  we  have  to  work  with  parts  of  an  image  that  are  close  to  the 
limits  of  the  pixels’  dynamic  range  [0,  255],  embedding  the  watermark  coefficients  and 
returning  back  to  the  space  domain  may  produce  results  that  exceed  this  dynamic  range 
(figure  13  left).  The  most  straightforward  solution  is  to  truncate  all  the  off-range  values  to 
either  0  or  255.  This  approach  may  produce  acceptable  results  in  terms  of  transparency, 
but  it  introduces  additional  irreversible  errors  to  the  decoder,  thus  reducing  the 
performance  of  the  system. 


Figure  13.  Lena  marked  (left)  and  marked  and  normalized  (right).  The  black  and 
white  dots  that  can  be  seen  in  the  left  image  are  considerably  fewer  in  the 
normalized  image.  In  this  case  stripes  was  used  and  the  watermark 
coefficients  were  randomly  distributed  throughout  the  image. 


In  cases  where  we  are  allowed  to  increase  the  bit  allocation,  we  could  use  an 
invertible  normalizer  (figure  13).  We  devise  a  normalization  function  that  maintains  the 
dynamic  range  of  the  pixel  values  within  the  allowable  limits.  Our  choice  for  the  forward 
and  the  reverse  function  was 


1  -w  /  1 

y  =  --tan  (n-(x--))  +  -, 
7T  L  L 


(5.2) 
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(5.3) 


^  tan(n- ■  (y  -  h) 

:-  + - — 

2  n 


The  normalization  parameter  n  is  used  to  adjust  the  steepness  of  the  curve.  The  value  of 
3.5  was  experimentally  proven  to  work  better  since  it  provides  good  pixel  transformation 
(no  shift)  in  the  mid  ranges.  The  characteristic  curve  of  equation  5.2  for  n=3.5  is  shown 
in  figure  14.  As  shown  in  the  figure,  the  curve  is  limited  within  the  range  [0,  1].  This  can 
be  adjusted  to  [0,  255]  by  multiplication  of  equation  5.2  by  255.  Appropriate 
modifications  should  also  be  made  to  the  inverse  function  (equation  14). 


Non-linear  normalization  using  n(x)=1/2-t-1/pi(atan(n(x-1/2))),  n=3.5 


Figure  14.  The  normalization  function  for  n=3.5. 


The  possible  price  to  pay  for  the  use  of  normalization  is  that  the  parts  of  the  image 
with  pixel  values  near  the  limits  of  the  dynamic  range  are  altered  and  their  pixel  values 
are  shifted  towards  the  center  of  the  range.  There  are  cases  where  this  shift  is  perceptible 
to  the  HVS  and  therefore  the  success  of  the  method  depends  heavily  on  the  histogram  of 
the  image. 

Any  invertible  function  may  be  used  instead,  provided  that  it  is  limited  within  the 
allowable  range.  The  effectiveness  of  the  function  is  measured  mainly  by  the  quality  of 
the  normalized  image  or  in  other  words  by  whether  any  changes  are  perceptible  to  the 
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HVS.  We  would  like  a  good  normalization  function  to  be  linear  with  a  slope  of  1  (0=45°) 
for  the  most  part  and  non-linear  only  at  the  boundaries. 

In  figures  15  and  16  we  show  the  effect  of  the  normalization  on  two  images, 
namely  pentagon  and  arctic  hare,  with  intensity  histogram  shown.  In  figure  15  we  see 
that  the  histogram  of  pentagon  is  concentrated  towards  the  middle  of  the  [0,  255]  range. 
Our  function  in  this  case,  works  particularly  well  and  the  normalization  in  the  right  image 
is  virtually  undetected.  The  changes  in  the  histogram,  which  indeed  occur  as  we  observe 
comparing  the  two  plots,  are  imperceptible  by  the  HVS. 


Figure  15.  The  original  (left)  and  the  normalized  (right)  pentagon  and  their 
corresponding  histograms  (courtesy  of  the  Signal  and  Image  Processing 
Institute  at  the  University  of  Southern  California). 
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In  figure  16  we  clearly  see  the  effect  of  normalization  in  images  with  many  pixels 
towards  the  boundaries  of  the  allowable  range.  In  arctic  hare  the  white  shades  (pixel 
values  near  255)  are  dominating  the  image  and  therefore  the  histogram  displays  a  large 
concentration  of  pixels  towards  the  right  end.  The  characteristic  function  that  we  used, 
performs  poorly  in  this  extreme  situation  as  it  imposes  significant  changes  to  most  of  the 
pixel  values  after  processing  through  the  normalizer. 


Figure  16.  The  original  (left)  and  the  normalized  (right)  arctic  hare  and  their 
corresponding  histograms  (courtesy  of  R.  E.  Barber,  Barber  Nature 
Photography). 


With  the  average  image,  the  scheme  seems  to  perform  quite  well.  The  use  of 
normalization  is  optional  and  drives  our  attention  to  the  trade-off  between  transparency 
and  performance. 
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E, 


DECISION  MAKING  DEVICE 


For  the  decision  making  device,  as  explained  in  the  previous  chapter,  we  used  the 
correlation  coefficient,  p,  between  the  original  watermark,  W,  and  the  recovered  object, 
Wr.  where  p  was  derived  from  equation  4.5.  In  the  literature  there  have  been  several 
works  that  use,  instead,  a  different  formula  (equation  5.4)  for  the  derivation  of p\ 


XXW{i,j)Wr(i,j) 


References  [22],  [23],  [31],  [35]  use  equation  5.4  or  variations  of  it,  which  all 
have  in  common  that  there  is  no  participation  of  the  recovered  object  element,  ,  in  the 

normalizing  denominator. 

We  claim  that  equation  4.5  (also  used  in  references  [33]  and  [34])  is  a  more 
accurate  approach.  By  using  5.4  it  is  possible  that  we  obtain  values  oip  beyond  the  range 
[0,  1].  Equation  (5.4)  can  be  rewritten  as; 

XXW(i,j)Wr(i,j) 

P  =  — - (5.5) 

ZZ[»'(i,y)] 

'  j 

Let  us  consider  the  special  case  where  W  and  are  exactly  the  same  except  for  one 

pixel,  say  {k,X),  where  Wr(K,X)  >W(k,X)  .  Clearly,  in  this  case,  the  numerator  becomes 
greater  than  the  denominator  and  thus  p  exceeds  1 . 

This  means  that  setting  the  threshold  T  to  1  does  not  guarantee  that  only  perfectly 
retrieved  watermarks  will  pass  the  evaluation  test  of  the  decision  making  device.  In  other 
words  the  normalization  is  not  correct,  and  we  obtain  erroneous  impressions  leading  to 
incorrectly  setting  of  the  threshold  value. 

F.  ERROR  CORRECTION  CODING 

In  an  attempt  to  reduce  the  number  of  errors  that  our  scheme  suffered  after  attacks 

(quantization),  we  tried  to  implement  Error  Correction  Coding  in  our  algorithm.  The  7/15 
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BCH  code  capable  of  correcting  up  to  two  errors  ([36])  was  selected  because  it  was  more 
eonvenient  in  terms  of  overhead  and  eorreetion  eapability. 

We  numbered  the  watermark  pixels  p,  from  1  to  L  ([pi,p2,...,pL]),  starting  from 
the  top  left  corner  and  sweeping  the  watermark  row-wise  from  left  to  right  and  from  top 
to  bottom.  From  eaeh  pixel  we  eneoded  the  seven  most  signifioant  bits  (leaving  out  the 
LSB).  These  are  the  bits  that  eontribute  more  to  the  determination  of  the  pixel  value 
whieh  in  turn  affeets  the  result  of  the  eorrelation  of  the  reeovered  watermark  with  the 
original  watermark  at  the  deeision  making  deviee.  The  smaller  the  number  of  errors  in  the 
most  signifioant  bits,  the  higher  the  eorrelation  between  the  reeovered  and  the  original 
watermark.  Therefore  it  is  very  important  that  these  bits  remain  free  of  errors. 

For  every  watermark  pixel  that  is  eneoded  (aotually  its  seven  most  signifioant 
bits),  we  get  an  8-bit  (or  one  pixel)  overhead  from  the  oode.  In  essenoe  after  enooding  an 
MxN  (=L)  watermark  we  obtain  an  extra  MxNxS  bits  or  MxN  (L)  pixels  that  we  need  to 
aooommodate.  The  overhead  bits  form  pixels  that  fill  first  oolumns  and  then  rows  to  the 
right  and  bottom  of  the  image.  When  the  overhead  pixels  are  exhausted  and  the  expanded 
watermark  is  not  a  perfeot  reotangle,  we  zero  pad  the  remaining  bit  positions  in  order  to 
get  an  {M+e)x  (N+e)  watermark,  where  e  is  the  number  of  extra  rows  and  oolumns  that 
were  added  to  the  original. 

This  is  illustrated  in  figure  17.  There,  one  oan  see  the  [pi,p2,p3,-..,PK,PA,,P|iv,PL-2, 
Pl-i,Pl]  initial  pixels,  the  respeotive  overhead  bits,  and  how  they  shape  the  watermark. 

In  Appendix  A  we  show  the  results  of  the  implementation  of  the  ECC. 
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Figure  17. 


The  eoneept  of  expanding  the  watermark  after  BCH  coding,  where  the 
pixels  p,  of  the  watermark  are  numbered  from  1  to  L,  and  o  are  the 
overhead  bits. 
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VI.  RESULTS 


Part  of  the  results  presented  here  are  also  included  in  reference  [37]. 

A.  TESTED  IMAGES  AND  WATERMARKS 
1.  Images 

In  this  research  we  used  several  images  bearing  different  visual  and  spectral 
characteristics.  The  classic  images  (Lena,  figshingboat  etc)  were  primarily  used  along 
with  some  others.  There  was  however  interest  in  conducting  experiments  with  images 
that  had  some  particular  spectral  or  visual  characteristics,  which  could  not  be  found  in  the 
regular  images.  Therefore  we  produced  a  set  of  what  we  called  artificial  images  that  fitted 
our  needs. 

a.  Regular  (Non-synthetic)  Images 

We  used  six  images  that  are  shown  in  figure  18.  The  distribution  of  their 
pixel  values  is  shown  in  figure  19  where  we  can  see  their  different  characteristics.  Lena, 
peppers  and  fishing  boat  cover  a  broad  portion  of  the  allowed  range  and  have  various 
peaks.  Arctic  hare  and  fishing  boat  have  narrower  histograms  with  one  large  peak,  and 
New  York  exhibits  a  rather  uniform  distribution  of  its  pixel  values. 

b.  Artificial  (Synthetic)  Images 

The  four  artificial  images  (figure  20)  may  be  grouped  into  two  pairs. 
ImageB  and  imageSB  form  the  first  pair,  while  imageR  and  irnageU  form  the  second  one. 
Both  images  of  the  first  pair  contain  only  four  different  levels  of  grayscale.  In  imageB  we 
have  four  large  256x256  blocks.  On  the  other  hand  imageSB  is  divided  into  a  large 
number  of  similarly  arranged  smaller  blocks  of  4x4.  Thus  the  two  images  have  exactly 
the  same  histogram  but  different  visual  and  spectral  characteristics.  Similarly  in  the  case 
of  imageR  and  imageU  they  both  contain  the  whole  range  of  grayscale  shades  but  again 
they  have  different  visual  complexity. 
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Figure  18.  The  six  regular  images  that  were  used  in  the  researeh.l 


1  The  image  arctic  hare  is  eourtesy  of  Robert  E.  Barber,  Barber  Nature  Photography.  The  image  New 
York  is  eourtesy  of  Patriek  Loo,  University  of  Cambridge.  All  other  non-synthetie  images  used  in  this 
researeh  are  eourtesy  of  the  Signal  and  Image  Proeessing  Institute  at  the  University  of  Southern  California. 


2,  Watermarks 

In  the  course  of  our  research  we  first  had  to  decide  what  watermarks  we  should 
use.  In  the  literature  different  types  of  watermarks  have  been  proposed.  The  type  of 
watermark  depends  on  the  general  concept  that  we  have  adopted  for  our  watermarking 
scheme.  One  of  the  most  fundamental  questions  is  whether  the  watermarking  algorithm  is 
open,  or  its  details  are  kept  secret.  In  the  latter  case  any  pattern,  however  simple  and 
abstract,  may  be  used  as  a  watermark.  In  the  former  case  however,  a  key  is  required  to 
take  care  of  the  security  issues.  One  of  the  most  common  suggestions  among  researchers 
is  a  random  watermark,  which  is  also  a  security  component.  The  watermark  itself  is  a  key 
that  an  intruder  is  not  aware  of,  and  thus  cannot  verify  its  existence  or  removal. 
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There  may  be  eases  however,  where  a  pereeptual  watermark  may  be  preferable. 
Sueh  a  watermark  might  be  more  appealing  for  oommereial  purposes  as  it  eontains  a 
visually  recognizable  pattern,  such  as  a  copyrighted  logo.  This  implies  that  either  the 
algorithm  should  be  kept  secret  or  that  a  unique  key  should  be  used  for  protection. 


Figure  20.  The  four  artificial  images;  imageB  (top  left),  imageSB  (top  right), 
imageR  (bottom  left),  and  imageU  (bottom  right). 

a.  Watermark  Selection 

In  our  research  we  tried  to  test  several  different  cases  and  this  justifies  the 
selection  of  our  watermarks  (figure  21).  The  watermark  stripes  is  a  simple  visual  pattern 
and  is  used  as  a  perceptual  watermark.  NPSlogo  is  also  a  perceptual  watermark  that  also 
has  some  random  characteristics.  Finally,  randWm  is  a  watermark  whose  pixels  are 
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randomly  chosen  and  follow  a  uniform  distribution.  All  three  watermarks  are  encoded 
with  8-bit  grayscale  in  the  range  ([0,  255]),  with  dimensions  64x64.  We  deeided  to  seleet 
gray  scale  watermarks  although  this  imposes  higher  burden  to  the  marked  image  in  terms 
of  embedding  eapaeity  requirements.  Grayseale  allows  for  a  rnueh  larger  set  of  possible 
keys,  thus,  providing  better  seeurity  and  at  the  same  time  allows  for  more  eomplieated 
pereeptual  watermarks. 


Figure  21.  The  three  used  watermarks:  stripes  (left),  NPSlogo  (middle)  where 
everything  exeept  the  letters'  baekground  is  random,  and  a  randWm  (right) 
with  all  pixels  uniformly  distributed  in  the  range  [0,  255]. 


B,  TESTING  THE  NON-UNIFORM  ALGORITHM 

1.  Transparency 

Using  the  algorithm  of  Chapter  IV  with  no  normalization,  and  images  with  8-bit 
aeeuraey,  we  obtain  very  good  performanee  in  terms  of  the  transparency  of  the 
watermark.  We  ean  appreeiate  the  results  from  figures  22  and  23,  where  we  show  the 
marked  image  next  to  the  original  one  to  allow  comparisons.  All  these  examples  are 
produced  with  the  NPSlogo  watermark,  a=0.1,  xstart=A,  es=2.  If  the  transparency 
achieved  is  evaluated  as  insufficient,  one  ean  make  appropriate  adjustments  to  a  and 
xstart  to  satisfy  his  own  requirements. 


59 


Figure  22.  Original  and  marked  (NPSlogo)  Lena  (top)  and  peppers  (bottom),  with 
a=0.1,  xstort=4,  es=2.  All  images  are  of  type  uintS  (courtesy  of  the  Signal 
and  Image  Processing  Institute  at  the  University  of  Southern  California). 
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Figure  23.  Original  and  marked  {NPSlogo)  arctic  hare  (top)  and  New  York 
(bottom)  with  a=0.1,  xstart  =4,  es=2.  All  images  are  of  type  uintS.  {arctic 
hare  is  courtesy  of  R.  E.  Barber,  Barber  Nature  Photography,  New  York  is 
courtesy  of  P.  Loo,  University  of  Cambridge) 

2,  Watermark  Recovery  from  Marked  Image 

The  marked  image  is  of  the  MATLAB  type  "unsigned  integer  with  8  bits"  (uintS). 
Should  the  system  work  perfectly,  the  decoder  would  normally  produce  a  recovered 
watermark,  which  would  be  identical  to  the  original,  since  we  do  not  apply  any 
distortions.  Flowever  the  uintS  type  is  an  integer  number  in  the  range  [0  255]  and 
therefore  has  finite  accuracy  (8  bits).  After  embedding,  the  IDCT  produces  non-integer 
results  that  are  rounded  to  the  nearest  integer,  and  therefore  some  noticeable  distortion  is 
present  (figure  24).  This  means  that  the  correlation  coefficient  p,  is  in  most  cases  lower 
than  1  even  when  the  marked  image  has  not  been  tampered  with. 
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Figure  24.  Recovered  watermark  from  the  marked  arctic  hare  of  figure  23. 


3,  Performance  after  Quantization 

The  performance  of  the  algorithm  under  quantization  varies  with  the  frequency 
band  that  is  selected  for  the  embedding  of  the  watermark  coefficients.  Figures  25  -  28 
show  that  as  we  embed  in  higher  frequency  coefficients  (larger  xstart),  there  is  a  general 
tendency  that  the  performance  becomes  poorer  (smaller  p).  This  is  expected,  since  the 
higher  frequency  coefficients  are  subjected  to  severe  quantization  and  therefore  any  small 
amount  of  watermarking  information  contained  in  these  coefficients  is  essentially 
eliminated.  Flowever  we  also  see  that  the  type  of  the  watermark  also  matters.  A 
watermark  with  more  random  elements  {NPSlogo  as  opposed  to  stripes)  produces 
considerably  worse  results.  This  can  be  explained  as  follows;  A  simple  perceptual  pattern 
like  stripes  has  some  large  coefficients  in  the  lower  frequencies  and  most  of  the 
remaining  DCT  coefficients  are  zero.  The  high  frequency  coefficients  are  severely 
quantized  and  after  quantization  the  watermark  information  is  truncated  to  zero.  If  the 
watermarking  information  was  zero  anyway,  then  the  quantization  has  essentially  no 
effect  and  therefore  the  performance  is  almost  the  same  regardless  of  the  frequency  band 
of  the  embedding.  This  train  of  thought  is  also  verified  by  the  images  imageB  and 
imageSB  (figure  27).  In  each  8x8  block  the  former  contains  only  a  DC  offset  and  all  the 
rest  of  the  coefficients  are  zero.  The  latter  however,  has  non-zero  coefficients  up  to  the 
middle  frequencies.  Therefore  for  values  of  xstart  up  to  the  middle  frequencies  the 
performance  of  these  two  artificial  images  is  different.  From  the  point  where  the 

coefficients  become  zero  and  on,  their  performance  becomes  identical.  This  observation 
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applies  only  with  the  watermark  stripes  because  the  NPSlogo  contains  random  pixels  that 
vary  considerably  for  different  iterations. 

For  smaller  embedding  size  es,  we  expect  that  the  transparency  of  the  watermark 
increases,  since  the  distortion  applied  to  each  block  is  considerably  smaller.  Additionally, 
with  smaller  es,  the  results  reveal  a  tendency  for  slightly  higher  p  (figures  29,  30,  31). 
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Figures  25  -  28  show  that  the  embedding  setup  can  be  adjusted  accordingly  to 
produce  values  of  p  very  close  to  1 .  This  means  that  an  appropriate  threshold  T  can  be 
chosen  depending  on  our  Pfa  and  Pd  requirements  (see  section  IV.C). 

4,  Robustness  to  Cropping 

The  performance  of  our  algorithm  to  cropping  varies  with  the  images  marked,  and 
there  are  cases  where  it  can  be  really  exceptional.  The  results  are  shown  in  tables  7  and  8. 
As  expected,  the  performance  improves  as  the  size  of  cropping  increases.  However  there 
should  be  a  point  where  the  size  of  the  cropping  becomes  too  large  for  the  algorithm  to 
handle.  From  that  point  on,  too  many  essential  watermark  coefficients  embedded  towards 
the  Center  of  Interest,  are  cropped  out,  making  it  impossible  for  the  algorithm  to  perform 
sufficiently.  At  this  point  however  the  image  is  so  severely  cropped  that  becomes  useless 
for  any  potential  adversary  (figure  32). 

Table  7.  Performance  of  the  Non-uniform  Algorithm  against  Cropping  (NPSlogo,  a=0A, 
xstart=4,  es=2). 


Image 

Maintained  pixels 
after  cropping 

(initially  512x512) 

P 

without 

CIPF 

P 

with 

CIP 

F 

Improve 

ment 

Lena 

(50:460,50:460) 

0.5121 

0.85 

24 

66.45% 

New  York 

(50:460,50:460) 

0.0544 

0.26 

89 

394.3% 

fishing  boat 

(50:460,50:460) 

0.2612 

0.38 

55 

47.58% 

peppers 

(50:460,50:460) 

0.0671 

0.38 

10 

467.81% 
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Tables.  Performance  of  the  non-uniform  algorithm  against  cropping  {NPSlogo,  a=0.1, 
xstart=A,  es=2). 


Maintained  pixels 
after  cropping 

(initially  512x512) 

Lena 

fishing  boat 

p 

without 

CIPF 

P 

with 

CIP 

F 

Improve 

ment 

p  without 
CIPF 

p  with 
CIPF 

Improve 

ment 

(11:502,  11:502) 

0.5704 

0.91 

72 

60.79% 

0.5749 

0.8467 

Al.11% 

(31:482,31:482) 

0.4143 

0.75 

58 

82.42% 

0.2728 

0.5595 

105.09% 

(51:462,51:462) 

0.3038 

0.55 

75 

83.50% 

0.1663 

0.3522 

111.78% 

(71:442,71:442) 

0.2253 

0.36 

81 

63.38% 

0.1580 

0.2390 

51.32% 

(91:422,91:422) 

0.1906 

0.29 

06 

52.46% 

0.1081 

0.1540 

42.46% 

(111:402,  111:402) 

0.1384 

0.18 

38 

32.80% 

0.0921 

0.1223 

32.79% 

Figure  32. 


Cropped  fishing  boat  with  remaining  pixels  [111  ;402,  111  ;402]  from  a 
512x512  image  (courtesy  of  the  Signal  and  Image  Processing  Institute  at 
the  University  of  Southern  California). 
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c. 


SELECTION  OF  THE  WEIGHTING  FACTOR 


The  weighting  faetor  a,  used  in  the  embedding  algorithm  (seetion  IV.B)  also 
affeets  the  system's  performanee.  Figures  33  and  34  show  that  New  York  and  Lena 
behave  similarly,  both  in  the  ease  where  the  image  is  quantized  and  in  the  ease  where  it  is 
not. 

Examining  the  performanee,  p,  of  the  system  with  Yew  York  for  various  values  of 
a  when  the  marked  image  is  of  type  uintS,  we  notiee  that,  in  ease  of  no  attaek  to  the 
marked  image,  we  get  a  maximum  of  p  at  a=0.1,  with  the  values  varying  slightly  between 
0.9477  (for  a=0.6)  and  0.9934  (for  a=0.1).  If  quantization  is  applied,  it  is  reasonable  that 
the  performanee  would  be  different.  Up  to  a  eertain  value  of  a,  we  have  a  dramatie 
performanee  improvement.  As  a  still  inereases,  the  amount  of  improvement  is  redueed 
and  the  performanee  beeomes  essentially  unehanged.  The  improvement  stops  for  a 
around  0.3.  At  this  value  of  the  weighting  faetor  however,  it  is  possible  that  the  effeets  of 
the  embedding  in  the  marked  image  are  already  visible  (figure  35).  The  results  after 
quantization  ean  be  signilieantly  worse  if  we  ehoose  to  embed  in  higher  frequeney 
eomponents  (subjeeted  to  severe  quantization).  In  the  seheme  we  proposed  in  Chapter  IV 
we  use  a=0.I  whieh  appears  to  give  the  best  performanee  if  no  quantization  is  used. 


Figure  33.  Performanee  measured  on  the  marked  New  York  image  (uintS)  for 
various  values  of  a  (watermark:  NPSlogo,  xstart=4,  embedding  size=2). 
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Figure  34.  Performance  measured  on  the  marked  Lena  image  in  uintS  for  various 
values  of  a  (watermark;  NPSlogo,  xstart=4,  embedding  size=2). 


Marked  Image 


Figure  35.  Lena  marked  with  NPSlogo  and  xstart=4,  es=2.  The  distortion  at  a=0.3 
is  clearly  visible  (courtesy  of  the  Signal  and  Image  Processing  Institute  at 
the  University  of  Southern  California). 
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VII.  CONCLUSION 


A,  SUMMARY 

In  this  thesis  we  studied  the  use  of  the  DCT  in  digital  watermarking.  After  a 
historie  overview  and  a  brief  presentation  of  the  relevant  work  that  has  been  condueted 
by  other  researchers  (Chapter  II),  we  went  through  a  brief  analysis  of  the  theory  behind 
the  DCT  and  the  JPEG  compression  (Chapter  III),  so  that  the  reader  would  obtain  all  the 
necessary  background  for  the  comprehension  of  our  work. 

In  Chapter  IV  we  presented  a  new  non-uniform  watermarking  algorithm  that  is 
based  on  the  Discrete  Cosine  Transform.  The  algorithm  embeds  normally  in  the  lower 
frequency  components  (this  case  proved  to  give  better  results),  and  achieves  sufficient 
transparency  of  the  watermark,  but  also  robustness  against  quantization  (i.e.  JPEG 
compression)  and  cropping  (Chapter  VI).  Einally,  some  other  parts  of  our  research  worth 
mentioning  are  discussed  in  Chapter  V. 

B,  SIGNIFICANT  REMARKS 

The  performance  of  Discrete  Cosine  Transform  -  based  digital  watermarking  still 
needs  further  investigation.  The  development  of  the  JPEG  2000  standard  and  how  this 
will  affect  the  domination  of  JPEG  should  be  taken  into  account  for  future  research  in  the 
watermarking  area. 

There  are  qualities  of  the  grayscale  watermark  that  affect  the  recovery  process.  A 
simple  perceptual  {stripes)  watermark  behaves  differently  from  a  highly  random  one 
(NPSlogo).  The  randomness  imposes  an  additional  obstacle  to  the  decoder  and  reduces 
the  performance  of  the  decision  making  device  especially  in  the  higher  frequencies. 

In  the  proposed  scheme  we  used  a  unique  metric  for  measuring  the  relative 
capacity  of  each  image  block  to  receive  watermark  information  without  perceptual 
distortion  of  the  overall  image.  In  addition  another  metric  is  used  (CIPE)  to  defeat 
cropping  attacks.  The  combination  of  the  two  metrics  is  used  to  prioritize  the  image 
blocks  and  determine  the  watermark  coefficients  that  will  be  embedded  in  each  one  of 
them.  The  achieved  transparency  appears  to  be  sufficient  but  the  evaluation  is  rather 
subjective,  based  only  on  observation. 
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The  scheme  responds  quite  well  to  quantization  allowing  for  the  determination  of 
a  threshold  T  according  to  our  Pfa  and  Pd  requirements.  As  we  embed  towards  higher 
frequency  coefficients  the  performance  becomes  poorer  because  of  the  severe 
quantization  in  the  higher  frequency  bands.  Decreasing  the  embedding  size  appears  to 
slightly  contribute  to  improvement  of  the  overall  performance.  Additionally  p  is  also 
affected  by  variations  of  the  weighting  factor  a.  The  experiments  show  that  there  is  a 
trade-off  because  as  we  increasing  a,  we  improve  p  but  also  the  transparency  is 
negatively  affected.  We  should  bear  in  mind  that  there  are  certain  limits  suggested  by 
data  where  increasing  a  is  meaningful.  Beyond  these  limits  the  watermark  correlation 
coefficient  is  not  further  improved  and  the  transparency  is  essentially  degraded. 

The  performance  of  the  proposed  scheme  against  cropping  attacks,  ranges  from 
mediocre  to  exceptionally  good  results,  depending  on  the  input  data.  Improvement  of  the 
correlation  coefficient  exceeding  100%  is  frequently  observed  depending  on  the  tested 
image  and  the  amount  of  cropping. 

C.  FUTURE  WORK 

Embedding  the  coefficients  in  a  way  that  will  take  full  advantage  of  the  human 
visual  system’s  characteristics  is  a  big  goal  of  the  watermarking  community.  Further 
research  in  this  subject  is  required  to  investigate  the  possibility  of  incorporating  our 
metric  to  the  Just  Noticeable  Difference  (JND)  models  that  have  been  proposed. 

In  addition,  in  this  work  the  transparency  of  a  watermarking  algorithm  is  judged 
by  the  subjective  decision  of  independent  observers.  The  possibility  of  developing  a 
formal  model  for  the  evaluation  of  the  transparency  may  be  investigated.  However  this 
task  is  not  trivial.  A  simple  correlation  test  between  the  original  and  the  marked  image 
would  not  work.  This  would  detect  any  differences  between  the  two  but  cannot  tell  if 
these  differences  occur  in  a  visually  perceptual  manner.  A  JND  model  could  be  used  as 
the  basis  for  the  evaluation  of  the  transparency  ([24]),  but  then  it  should  not  also  be  used 
in  the  embedding  model.  Otherwise  the  judgment  would  be  biased  and  therefore  unable 
to  give  dependable  results. 
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D,  EPILOGUE 


The  watermarking  community  is  still  far  from  presenting  a  dominating 
watermarking  scheme.  The  research  in  digital  watermarking  has  been  dictated  by  the 
developments  in  the  digital  world  and  the  need  for  a  dependable  copyright  protection 
scheme.  In  other  words,  the  watermarking  community  has  been  just  following  the 
advances  of  other  technologies.  Maybe  this  is  the  largest  drawback  that  keeps  the 
watermarkers  away  from  the  desired  goal.  The  extreme  pace  with  which  the  digital 
technologies  are  progressing,  does  not  allow  sufficient  time  for  the  research  in  the  various 
watermarking  areas  of  interest  to  mature  and  produce  results.  A  more  independent  path 
may  be  the  secret  for  the  success,  and  maybe,  in  the  future,  the  compression  algorithms  or 
storage  techniques  will  be  following  the  developments  of  the  watermarking  world. 
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APPENDIX  A.  RESULTS  OF  THE  ECC  IMPLEMENTATION 


The  attempt  to  incorporate  an  error  correction  code  in  the  scheme  failed  because 
of  the  insufficient  error  correction  capability  of  the  code  we  chose.  Investigating  the  bit 
error  rate  between  the  original  and  the  recovered  watermark  we  see  that  in  general  is 
beyond  the  2  bits  that  the  7/15  BCH  code  is  capable  of  correcting.  The  results  are  shown 
in  figures  36  -  39.  The  bit  error  rate,  BER,  calculates  the  number  of  errors  in  bits  per 
pixel  (figures  36,  37).  To  get  a  better  idea  about  the  distribution  of  errors  among  the 
pixels  we  also  used  a  modified  bit  error  rate  metric,  the  BERmod,  which  is  the  number  of 
errors  in  bits  per  pixel  in  error  (figures  38,  39).  This  is  determined  by  the  total  number  of 
bit  errors  and  divided  by  the  number  of  pixels  where  errors  are  detected.  For  all  the  tested 
images  the  results  were  very  similar  and  definitely  exceeded  3  bits  per  pixel  for  both 
metrics.  The  BERmod  plots  are  almost  the  same  as  the  BER  ones,  showing  that  the  errors 
were  evenly  distributed  among  the  watermark  pixels. 
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APPENDIX  B.  SOFTWARE 


%  loannis  Retsas 
%  August  12,  2001 

%  LAST  MODIFICATION:  February  4,  2002 

%  FILE  NAME:  Encoder 

%  DESCRIPTION:  This  is  the  main  encoding  file  of  our  watermarking  framework. 


clear  all 

delete  C : \matlabR12\work\* .mat 
disp  (  'Processing. .  .  ' ) 

%*************** *****************3EyuP  SECTION******************************* 

I  =  imageSelectionC;  %  selecting  an  image  for  processing  from  the  gallery 
save  C:\matlabR12\work\I  I 

[M,N]  =  size  (I);  %  M,N  are  the  image  dimensions 

if  ( (M/8) /fix (M/8)  ~=  1)  I  ( (N/8) /fix (N/8)  ~=  1) 

fprintf (1, 'The  dimensions  of  the  selected  image  are  not  multiples  of  8\nand 
errors  will  occur; \nTHE  PROGRAM  IS  TERMINATED\N ' ) ; 

return 

end 

fprint f ( 1 , ' The  watermark  size  is  set  by  default  to  64x64 ; \n ') ; 

%  Any  modification  of  the  size  should  consider  the  dimensions  of  the  image  and 
%  the  embedding  size. 

Mw  =  64; 

Nw  =  64; 

W  =  WmTypeC (Mw, Nw) ; 

save  C:\matlabR12\work\W  W 

% - WEIGHTING  FACTOR - 

alpha  =  input  (' Set  the  weighting  factor  alpha  (recommended  value  0.1) ;\n'); 

disp  (  'Processing. .  .  ' ) 

save  C:\matlabR12\work\alpha  alpha 

% - XSTART - 

start  =  input  (' Set  the  index  of  the  coefficient  (1  to  64)  where  theXnembedding 
would  start  in  each  block; \n'); 

while  (start  <=  0)  |  (start  >=  64)  |  (start/fix (start) ~=1) 
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fprintf (1, 'Your  choice  was  either  beyond  the  allowed  range  or  was  not  an 
integer; \n ' ) ; 

start  =  input ('Try  again: \n'); 

end 

disp ( 'Processing. . . ' ) 

save  C:\matlabR12\work\start  start 

% - EMBEDDING  SIZE - 

fprintf (1, 'Set  the  embedding  size  (number  of  watermark  coefficients  per 
block) ; \n ' ) ; 

length  =  input (' Choose  2,  4  or  8;\n'); 

while  (length  ~=  2) & (length  ~=  4) & (length  ~=  8) 

start  =  input ('Your  choice  should  be  2,  4  or  8;  Try  again :\n'); 

end 

disp ( 'Processing. . . ' ) 

save  C:\matlabR12\work\length  length 

% - CROPPING  SETUP - 

flagCrop  =  input  ('For  cropping  press  I;  otherwise  press  0;\n'); 
while  (flagCrop  ~=  0) & (flagCrop  ~=  1) 

flagCrop  =  input ('Your  choice  should  be  either  0  or  1;  Try  again :\n'); 

end 

disp ( 'Processing. . . ' ) 

save  C:\matlabR12\work\flagCrop  flagCrop 
if  flagCrop  ==  1 

leftB  =  input  ('Enter  the  column  that  will  be  the  new  LEFT  border  of  the 
Image ; \n ' ) ; 

disp ( 'Processing. . . ' ) 

rights  =  input  ('Enter  the  column  that  will  be  the  new  RIGHT  border  of  the 
Image ; \n ' ) ; 

disp ( 'Processing. . . ' ) 

upperB  =  input  ('Enter  the  row  that  will  be  the  new  UPPER  border  of  the 
Image ; \n ' ) ; 

disp ( 'Processing. . . ' ) 

lowerB  =  input  ('Enter  the  row  that  will  be  the  new  LOWER  border  of  the 
Image ; \n ' )  ; 

disp  (  'Processing. .  .  '  ) 

cropParam  =  [leftB  rights  upperB  lowerB]; 
save  C:\matlabR12\work\cropParam  cropParam 

end 

% - QUANTIZATION - 

flagQ  =  input (' For  quantization  press  1;  otherwise  press  0;\n'); 
while  (flagQ  ~=  0)& (flagQ  ~=  1) 

flagQ  =  input ('Your  choice  should  be  either  0  or  1;  Try  again: \n'); 
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end 


disp ( 'Processing. . . ' ) 

% — QUALITY  FACTOR — 
if  flagQ  ==  1 

q_jpeg  =  input  (' Set  the  quality  factor  q_jpeg  in  the  range  [ 1 , 100 ] ; \n ' ) ; 

while  (q_jpeg  <  1) | (q_jpeg  >  100) | (q_jpeg/f ix (q_jpeg) ~=1 ) 

fprintf ( ' Your  choice  was  either  beyond  the  allowed  range  or  was  not  an 
integer; \n ' ) ; 

q_jpeg  =  input  ('Try  again :\n'); 

end 

disp ( 'Processing. . . ' ) 

end 

save  C:\matlabR12\work\flagQ  flagQ 

% - MARKED  IMAGE  IN  UINT8 - 

flags  =  input (' For  marked  image  in  uintS  press  1;  otherwise  press  0;\n'); 
while  (flags  ~=  0)&(flagS  ~=  1) 

flags  =  input ('Your  choice  should  be  either  0  or  1;  Try  again: \n'); 

end 

disp ( 'Processing. . . ' ) 
save  C:\matlabR12\work\flag8  flagS 
% —  IF  MARKED  IMAGE  REAL  — >  NORMALIZATION 
if  flags  ==  0 

flagNorm  =  input ('For  normalization  press  1;  otherwise  press  0;\n'); 
while  (flagNorm  ~=  0)& (flagNorm  ~=  1) 

flagNorm  =  input ('Your  choice  should  be  either  0  or  1;  Try  again :\n'); 

end 

disp ( 'Processing. . . ' ) 

save  C:\matlabR12\work\flagNorm  flagNorm 

end 

^•>'r*7'r*'>'r*7'r*'>'r*7'r*'>'r**'*^***'*^*'*^*'*^***'*^*'*^*'*^***'*^*'*^*'*^***'*^*'*^*7'r*'>'r*':ir*'>'r*7'r*'>'r*'>'r*'>'r*7'r*^ 

%*************** *********PrOCESSING  section********************************** 

% - DCT  OF  THE  IMAGE - 

Id  =  double (I);  %  It  is  assumed  that  the  image  from  is  in  uintS  form  [0  255] 

T  =  dctmtx ( 8 ) ; 

dct I  =  blkproc(Id, [8  8],'Pl*x*P2',T,T'); 
save  C:\matlabR12\work\dctI  dcti 

% - DCT  OF  THE  WATERMARK - 

Wd  =  double (W) ; 

dctW  =  blkproc (Wd, [8  8],'Pl*x*P2',T,T'); 
save  C:\matlabR12\work\dctW  dctW 
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clear  Wd 


% - EMBEDDING - 

dcti  =  embeds (dcti, dctw, alpha,  start,  length)  ; 
clear  dctW 

% - IDCT  OF  MARKED  IMAGE  COEFFICIENTS - 

Im  =  blkproc (dct I , [ 8  8 ] , ' Pl*x*P2 ' , T ' , T) ;  %  scrambled  marked  image 
clear  dcti 

% - UINT8  -  SNR - 

if  flags  ==  1 

Im  =  uint8(Im);  %  Im  is  the  marked  image  in  uintS 
SNR8  =  SNR (Id, double (Im) ) ; 

fprint f ( 1 , ' SNR  of  uintS  image,  SNR8 (dB) =%I . 4f\n ' , SNR8) ; 

else 

% - NORMALIZATION  -  SNR - 

SNRr  =  SNR (Id, Im)  ; 

fprint f ( 1 ,' SNR  of  real  image,  SNRr (dB) =%1 . 4f\n ',  SNRr)  ; 

Im  =  Im/255;  %  reduce  Image  to  range  [0  I]  plus  some  distortion  caused  from 
the  embedding 

if  flagNorm  ==  1 

n  =  3.5;  %  selected  optimal  value 

save  C:\matlabR12\work\n  n 

Im  =  1 /pi*atan (n* ( Im-1 /2 ) ) +1 /2 ;  %  normalization 
SNRnorm  =  SNR (I, (255*Im) ) ; 

fprintf  (1,  'SNR  of  real,  normalized  image, 

SNRnorm (dB) =%1 . 4f \n ' , SNRnorm) ; 

end 

end 

save  C:\matlabR12\work\Im  Im 

% - QUANTIZATION - 

if  flagQ  ==  1 

if  flags  ==  0 

Im  =  255*double  (Im)  ;  %  we  multiply  by  255  to  return  to  the  correct 

scale 

end 

Imq  =  qFunc ( Im, q_jpeg) ; 

if  flags  ==  0 

SNRrmq  =  SNR ( Id, Imq) ; 

fprintf (1, 'SNR  of  real,  marked  and  quantized  image, 

SNRrmq (dB)=%1.4f\n',  SNRrmq )  ; 

Imq  =  Imq/255; 

else 
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Imq  =  uint8(Imq); 

SNRSmq  =  SNR ( Id, double ( Imq) ) ; 

fprintf (1, 'SNR  of  uint8,  marked  and  quantized 
SNRSmq (dB) =%1 . 4f \n ' , SNR8mq) ; 

end 

save  C:\matlabR12\work\Imq  Imq 

end 

clear  Id 

%*************** *********d];3pl;^Y  section******************************** 
figure  ( 1 ) 
imshow ( I ) 

title (' Original  Image') 

%  figure  (2 ) 

%  imagesc(W, [0  255]),  colormap (gray ) , 

%  title (' Watermark ' ) 

figure ( 3 ) 
imshow ( Im) 

title (' Marked  Image') 

%  figure ( 4  ) 

%  imshow  ( Im-double ( I ) /2  55 ) 

%  title (' Difference  between  Marked  &  Original') 
if  flagQ  ==  1 

imhist ( Imq, 64 ) 

end 

if  flagQ  ==  1 
figure 
imshow ( Imq) 

title (' Quantized  Marked  Image') 

end 


image. 
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^•>'r*7'r*'>'r*':ir*'>'r*'>'r*'>'r**'*^***'*^*'*^*'*^***'*^*'*^*'*^***'*^*'*^*'*^***'*^*'*^*'>'r*'>'r*':ir*'>'r*7'r*'>'r*^*'>'r*7'r*'>'r*'^ 

^•>'r*7'r*'>'r*7'r*'>'r*7'r*'>'r**'*^***'*^*'*^*'*^***'*^*'*^*'*^*'*^*'*^*'*^*'*^***'*^*'*^*7'r*'>'r*':ir*'>'r*7'r*'>'r*'>'r*'>'r*'^ 

%  loannis  Retsas 
%  Aug  12,  2001 

%  LAST  MODIFICATION:  February  12,  2002 
%  FILE  NAME:  Decoder 

%  DESCRIPTION:  This  file  recovers  the  Watermark  from  a  marked  Image 

%*************** ****************D^'j^  loading********************************* 

clear  all 


load  C:\matlabR12\work\indexGr 
load  C:\matlabR12\work\index 
load  C:\matlabR12\work\alpha 
load  C:\matlabR12\work\start 
load  C:\matlabR12\work\length 
load  C:\matlabR12\work\flagCrop 
load  C:\matlabR12\work\flagQ 
if  flagQ  ==  1 

select  =  input ('Press  0  to  process  the  marked  image;  press  1  to  process  the 
quantized,  marked  imageXn'); 

while  (select  ~=  0) & (select  ~=  1) 

select  =  input  ('Your  choice  should  be  either  0  or  1;  Try  again:\n'); 

end 

end 

disp ( 'Processing. .  .  ' ) 

load  C:\matlabR12\work\flag8 

if  flags  ==  0 

load  C:\matlabR12\work\flagNorm 

end 

load  C:\matlabR12\work\I 
load  C:\matlabR12\work\W 
%  load  C:\matlabR12\work\KeY 
[Mw,Nw]  =  size (W) ; 
if  flagQ  ==  1 

if  select  ==  0 

load  C:\matlabR12\work\Im 
Itest  =  Im; 
clear  Im 
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elseif  select  ==  1 

load  C:\matlabR12\work\Imq 
Itest  =  Imq; 
clear  Imq 

end 

else 

load  C:\matlabR12\work\Im 
Itest  =  Im; 
clear  Im 

end 

Itest  =  double ( Itest )  ; 

%***************  *****************qrOPPING*  *********************************** 
if  flagCrop  ==  1 

load  C:\matlabR12\work\cropParam 
if  flags  ==  0 

II  =  0 . 5*ones ( size ( Itest , I ), size  ( Itest , 2 )) ; 

else 

II  =  128*ones ( size ( Itest , I ), size ( Itest , 2 )) ; 

end 

1 1 (cropParam ( 3 )  : cropParam ( 4 ) , cropParam ( I )  : cropParam  (2 )  )  = .  .  . 

Itest ( cropParam ( 3 ) : cropParam ( 4 ) , cropParam ( 1 ) : cropParam (2) ) ; 
title_array  =  strcat (' Cropped  Marked  Image  (alpha=',  num2str (alpha) ,  ') ') 

if  flags  ==  0 

figure  (5),  imshow(Il),  title (title_arraY) 

else 

figure  (5),  imshow (uintS ( II ) ) ,  title (title_array) 

end 

Itest  =11; 
clear  II 

end 

[M,N]  =  size  ( Itest ) ;  %  final  dimensions  after  cropping 

^•>'r*7'r*'>'r*':ir*'>'r*7'r*'>'r**'*^***'*^*'*^*'*^***'*^*'*^*'*^***'*^*'*^*'*^***'*^*'*^*7'r*'>'r*^*'>'r*7'r*'>'r*':ir*'>'r*'>'r^ 

%***************  *****************PrOCESSING*  ********************************* 


DENORMAL I ZING- 


if  flags  ==  0 

if  flagNorm  ==  1 

load  C:\matlabR12\work\n 

Itest  =  1/2  +  tan (pi* ( Itest-1/2 ) ) /n; 

end 
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Itest  =  255*Itest;  %  bring  to  range  [0,255] 

end 

% - ORIGINAL  IMAGE  DCT - 

load  C:\matlabR12\work\dctI 

% - TEST  IMAGE  DCT - 

T  =  dctmtx ( 8 ) ; 

dot Itest  =  blkproc(Itest,[8  8],'Pl*x*P2',T,T'); 
clear  Itest 

% - RECOVERED  WATERMARK  DCT  (EXTRACTED) - 

dctWr  =  extract 8 (dot I , dot Itest , Mw, Nw, index, indexGr , alpha, start , length) ; 
clear  dcti 
clear  dctitest 

% - IDCT  ON  RECOVERED  WATERMARK  COEFFICIENTS - 

Wr  =  blkproc (dctWr , [ 8  8 ] , ' PI *x*P2 ' , T ' , T) ;  %recovered  watermark 
Wr  =  uint8 (round (Wr) ) ; 
clear  dctWr 

% - BER - 

ber  =  BER(W,Wr) ; 

fprint f ( 1 ,' BER (bits  per  pixel) =%1 . 4f\n ', ber) ; 
bermod  =  BERmod (W, Wr ) ; 

fprint f ( 1 ,' BERmod (bit s  per  pixel  with  error) =%1 . 4f\n ', bermod) ; 

% - rho - 

rho  =  corCoef (W, Wr ) ; 
fprintf(l, 'rho=%1.4f\n', rho) ; 

% - XCORR - 

%  R  =  xcorr2 (double (W) , double (Wr) ) ; 

%  max_R  =  max (max (R) ) ; 

%  R  =  R/max_R; 

%*************** *****************D];3pL^Y** *********************************** 

%  if  select  ==  0 

%  title_array  =  strcat('Wm  recovered  from  marked  Image  (alpha=', 

num2str (alpha) ,  ' ) ' ) ; 

%  elseif  select  ==  1 

%  title_array  =  strcat('Wm  recovered  from  quantized  Image  (alpha=', 

num2str (alpha)  ,  '  )  '  )  ; 

%  end 

figure  (11) 

imagesc (Wr , [ 0  255]),  colormap (gray ) ,  title (' Recovered 

Watermark' ) %title (title_array ) 
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figure  ( 12 ) 

title_array  =  strcat('Xcor  between  W,Wr  (a=',  num2str(a), 
mesh (R) ; 
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function  [I]  =  imageSelectionC 


%  loannis  Retsas 
%  August,  2001 

%  LAST  MODIFICATION:  February  4,  2002 

%  FUNCTION:  imageSelectionC 
%  INPUT:  - 

%  DESCRIPTION:  A  routine  that  allows  the  user  to  select  an  image  from  the 
%  gallery. 

%  RETURNS:  The  image  to  be  processed  in  grayscale  uintS  form. 

%  NOTE : 1 .  " f ishingboat "  and  "pentagon"  are  already  grayscale  (no  need  for 

rgb2gray ) 


( imageB, 


2.  All  the  artificial  images  are  saved  in  the  corresponding  files 


%  imageSB,  imageR,  imageU)  with  he  same  variable  name  "image". 

%  3.  The  images  must  have  dimensions  that  are  multiple  of  8  to  be 

processed 


accordingly 


by  the  framework;  therefore  "arctichare"  and  "newyork"  are 


modified. 


disp(' Select  image  from  gallery;  Use...') 

fprintf(l,'l  for  Lena; \n2  for  peppers; \n3  for  fishing  boat;\n'); 

fprintf(l,'4  for  arctic  hare;\n5  for  New  York; \n6  for  pentagon; \n ') ; 

fprintf(l, '7  for  imageB  (blocks) ;\n8  for  imageSB  (small  blocks) ;\n9  for  imageR 
(random) ; \n  '  ) ; 

file  =  input  ('10  for  imageU  (unif orm) ; \n ' ) ; 
disp ( 'Processing  Data. . . ' ) 

if  file  ==  1 

I  =  imread ( ' lena7 ' , ' bmp ' ) ; 

I  =  rgb2gray ( T )  ; 
elseif  file  ==  2 

I  =  imread (' peppers ',' bmp ')  ; 

I  =  rgb2gray ( I )  ; 
elseif  file  ==  3 

I  =  imread (' f ishingboat ',' bmp ')  ; 
elseif  file  ==  4 

I  =  imread (' arctichare ',' bmp ')  ; 

I  =  rgb2gray ( I )  ; 
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I  =  I(:,l:592); 
elseif  file  ==  5 

I  =  imread ( ' newyork ' , ' bmp ' ) ; 

I  =  rgblgray ( I )  ; 

I  =  1(1: 512, 1 : 512) ; 
elseif  file  ==  6 

1  =  imread ( ' pentagon ' , ' bmp ' ) ; 
elseif  file  ==  7 

load  C : \NlNl\Thesis\ Images \imageB 
1  =  image; 
elseif  file  ==  8 

load  C : \NlNl\Thesis\ Images \imageSB 
1  =  image; 
elseif  file  ==  9 

load  C : \NlNl\Thesis\lmages\imageR 
1  =  image; 
elseif  file  ==  10 

load  C : \NlNl\Thesis\lmages\imageB 
1  =  image; 

end 
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function  [Ws,  Key]  =  keyingC (W) 


%  loannis  Retsas 
%  Aug  31,  2001 

%  LAST  MODIFICATION:February  14,  2002 

%  FUNCTION:  keyingC 

%  INPUT:  A  matrix  W  of  class  uintS. 

%  DESCRIPTION:  . . . 

%  RETURNS:  The  keyed  matrix  Wk  (uint8) ,  and  the  Key  used. 

[Mw,Nw]  =  size (W) ; 

%  W  =  uint8  (round  (W)  )  ;  %  if  you  include  this  you  get  an  error  if  the  input  is 

uint8 

Key  =  uint 8 ( round ( rand (Mw, Nw, 8 ) ) ) ; 

Wp  =  bitPlanes (W, 8 ) ;  %  the  watermark  decomposed  into  planes 
for  k  =  1 : 8 

Wk  (  :  ,  :  ,  k)  =  xor  (Key  ( :  ,  :  ,  k)  ,  Wp  ( :  ,  :  ,  k)  )  ;  %  each  plane  is  now  coded  with  the 

Key 

end 

Wk  =  ibitPlanes (Wk) ;  %  the  keyed  watermark  is  reassembled 

Wk  =  uint 8 (Wk) ; 
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function  [W]  =  WmTypeC (Mw, Nw) 


%  loannis  Retsas 
%  Aug,  2001 

%  LAST  MODIFICATION:  February  4,  2002 

%  FUNCTION:  WmTypeC 

%  INPUT:  The  dimensions  Mw,Nw,  of  the  watermark. 

%  DESCRIPTION:  A  function  that  accepts  the  watermark's  dimensions,  takes  you 
%  through  a  watermark  selection  process,  and... 

%  RETURNS:  A  grayscale  watermark  W. 

%  NOTE:  If  the  NPS  logo  is  selected,  the  size  is  by  default  64x64  (not 
adjustable) 

%  and  the  input  values  are  ignored. 

fprintf (' Select  one  of  the  watermarks.  Use:\n') 

selection  =  input (' 1  stripes; \n2  for  NPS  logo;\n3  for  random  grayscale 
watermark\n ' )  ; 

while  (selection  ~=  1 )&( selection  ~=  2 )&( selection  ~=  3) 

selection  =  input ('Your  choise  should  be  one  of  1,2  or  3;  Try  again: \n'); 

end 

disp ( 'Processing  Data.  .  .  ' ) 
if  selection  ==  1 

W  =  stripes (Mw, Nw) ; 
elseif  selection  ==  2 
W  =  NPS; 

elseif  selection  ==  3 
W  =  randWm (Mw, Nw) ; 

end 
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function  [dctio]  =  embedS (dct I , dctw, alpha, start , length) 


%  loannis  Retsas 
%  Nov  15,  2001 
%  LAST  MODIFICATION: - 
%  FUNCTION:  embeds 

%  INPUT:  The  matrix  dcti  which  will  be  marked,  the  matrix  dctw  which  will 
%  mark  dctI,  the  weighing  factor  alpha,  the  coefficient  in  each  8x8  block 
%  where  the  embedding  starts,  the  number  of  coefficients  that  are  embedded 
%  in  each  8x8  block. 

%  DESCRIPTION:  For  each  8x8  block  we  calculate  the  CIPF  (depending  on  each 
%  distance  from  the  Cl) .  The  dcti  is  reshaped  to  8x8x(M*N/64)  (we  sweep  dcti 
%  left  to  right  and  top  to  bottom) .  For  each  block  we  calculate  its  OF  &  the  PC 
%  (=  E*RIPF) 

%  and  we  sort  the  blocks  according  to  descending  PCs.  The  watermark 
coefficients 

%  are  sorted  by  magnitude  and  divided  into  [length]  number  of  groups.  The 
%  elements  of  each  group  with  the  same  index  ([length]  in  total)  form  a  set  of 
%  embedding  coefficients.  Now  each  set  is  embedded  into  each  8x8  block  starting 
%  from  the  [start]th  coefficient.  Finally  we  reshape  the  marked  matrix  to  its 
%  original  dimensions. 

%  RETURNS:  A  matrix  dctio  with  the  marked  coefficients. 

%  CAUTION: It  is  required  that  [length]  divides  exactly  (Mw*Nw)  and  that 
(Mw*Nw/ length) 

%  is  equal  or  smaller  than  the  number  of  8x8  image  blocks. 

[M,N]  =  size (dcti); 

[Mw,Nw]  =  size (dctw) ; 

%  we  mark  each  8x8  block  with  its  Eucledean  distance  from  the  center 

%  r(x,y)  is  the  distance  of  the  center  of  block  (x,y)  from  the  center  of  the 
image 

for  m= 1 : 8  :  M 

for  n=l:8:N 

r(fix(m/8)+l,fix(n/8)+l)  =  ( ( (m+3 ) -M/2 ) ^2  +  ( (n+3 ) -N/2 ) ^2 ) ^  ( 1 /2 ) ; 

end 

end 

%  reshaping  the  matrix  from  MxN  to  8x8x(M*N/64) 
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%  with  this  technique  we  sweep  the  matrix  row-wise  (left  to  right  -  top  to 
bottom) 

k  =  1; 

for  i  =  1 : 8  :M 

B ( : , : , k : k+N/ 8-1 )  =  reshape (dct I ( i : i+7 , : ) , 8 , 8 , N/8 ) ;  %  B  is  8x8x4!!! 
k  =  k+N/8; 

end 

%  we  similarly  (row-wise)  reshape  the  matrix  r  with  the  distances 
r_line  =  reshape  (r size (r, 1 ) *size (r, 2 ), 1 ) ; 

%  we  calculate  for  each  block  the  CIPF  (Center  of  Interest  Proximity  Factor) 
rmax  =  max (max (r) ) ; 

CIPF  =  -l/pi*atan ( 15* (r /rmax-2/3 ) ) +1/2 ; 

weight  =  [  ]  ;  %  this  is  the  vector  that  will  accomodate  the  different  weight 

that  is  used  for  each  coefficient 

for  i  =  1:63 

weight  =  [weight  i]  ; 

end 

%  F(i)  is  the  CF  calculated  for  each  block  i 
for  i  =  l:size(B,3) 

B2  =  B(:, :,i) ; 

V  =  zigzag (abs  (B2 )) ; 

F(i)  =  weight *V ( 2 : 64 ) ' ; 

%  for  each  block  we  determine  a  Priority  Coefficient  PC  which  is  the  CF 
%  weighted  by  the  CIPF 
PC  (i)  =  F  (i)  *CIPF  (i)  ; 

end 

PC  =  PC/max (F);  %  normalization 

[ varB ( size  (B, 3 )  : -1 : 1 ) ,  index ( size (B, 3 ): -1 : 1 ) ]  =  sort (PC) ; 

B(:,:,:)  =  B (:,:, index) ;  %  B  contains  the  8x8  blocks  sorted  by  descending  order 
of  PC 

save  C:\matlabR12\work\index  index 

%  sorting  of  the  dct  coefficients  of  the  watermark  by  descending  order  of 
magitude 

[x,  index2 (Mw*Nw : -1 : 1 ) ]  =  sort (abs (dctw (:))) ; 

dctw  =  dctw ( index2 )  ;  %  this  way  we  avoid  changing  the  values  to  positive  after 

sorting  by  var 
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%  group  the  dot  coefficients  of  the  watermark  in  [length]  groups 
for  i  =  1 : length 

gr(:,i)  =  dctW ( ( i-1 ) *Mw*Nw/length+l : i*Mw*Nw/length)  '  ; 

indexGr(:,i)  =  index2 ( (i-1 ) *Mw*Nw/length+l : i*Mw*Nw/length) ' ; 

end 

save  C:\matlabR12\work\indexGr  indexGr 

%  embedding 

for  i  =  l:size(gr,l) 

V  =  zigzag  (B  (:,:,  i)  )  ;  %  V  is  a  row  vector  that  contains  the  elements  of  an 

8x8  block  aligned  in  zz  fashion. 

V  ( start : start  +  length-1 )  =  V ( start : start  +  length-1 )  +  alpha*gr(i,:); 

B(:,:,i)  =  zzRvs (V) ; 

end 

B(:,:,  index)  =  B(:,:,:);  %  desorting  the  8x8  dct  blocks  of  the  image  to  get 

their  original  order 

k  =  1; 

for  i  =  1 :  8  :M 

dct lo ( i : i  +  7 ,  : )  =  reshape (B  (  : ,  :  ,  k : k+N/ 8-1), 8, N);  %  contains  the  marked  dct 

coefs  of  the  image 

k  =  k+N/8; 

end 
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function  dctWr=extract 8 (dct I , dct Im, Mw, Nw, index, indexGr , alpha, start , length) 

%  loannis  Retsas 
%  Sep  19,  2001 

%  LAST  MODIFICATION:Nov  15,  2001 
%  FUNCTION:  extracts 

%  INPUT:  The  matrix  dcti,  the  marked  matrix  dctim,  the  watermark  dimensions  Mw, 

%  Nw,  the  vectors  index  and  indexGr  from  embedS,  alpha,  start  and  length. 

%  DESCRIPTION:  We  calculate  the  difference  between  the  two  input  matrices. 

%  The  dctDif  is  reshaped  to  8x8xXXXX.  The  start  to  start+length-1  coeffs  are 
selected  and 

%  are  put  back  together  to  make  up  the  watermark. 

%  RETURNS:  The  dct  coeffs  of  the  retrieved  watermark. 

^•>'r*'>'r*'>'r*':ir*'>'r*7'r*'>'r**'*^***'*^*'*^*'*^***'*^*'*^*****'*^*'*^*'*^***'*^*'*^*7'r*'>'r*':ir*^*7'r*'>'r*^*'>'r*7'r*'>^ 

[M,N]  =  size(dctl); 
dctDif  =  dctim  -  dctI; 
k  =  1; 

for  i  =  1:8:M  %  reshape 

B ( : , : , k : k+N/ 8-1 )  =  reshape (dctDif ( i : i+7 ,:), 8 , 8 , N/8 ) ;  %  B  is  8x8x4!!! 
k  =  k+N/8; 

end 

B  =  B ( : , : , index) ;  %  sorting  using  index  (:,:,:) 
embeddingSetsNumber  =  Mw*Nw/length; 
for  i  =  1  :  embeddingSetsNumber 

V  =  zigzag  (B  (:,:,  i ))  ;  %  V  is  a  row  vector  that  contains  the  elements  of  an 

8x8  block  aligned  in  zz  fashion. 

dctWr(i,:)  =  V(start:start+length-l)/alpha; 

B(:,:,i)  =  zzRvs (V) ; 

end 


%  k  =  1;  %  counts  the  number  of  blocks  that  are  being  embedded. 

%  q  =  1;  %  counts  the  4  coefficients  that  are  embedded  in  each  block. 

%  for  i  =  1  :  Mw*Nw 

%  V  =  zigzag  (B  (:,:,  k)  )  ;  %  V  is  a  row  vector  that  contains  an  8x8  block 

aligned  in  zz  fashion. 

%  dctWr(i)  =  V ( 6+q) /alpha;  %  embedding  on  the  7th, 8th, 9th  and  10th 

coefficients . 


q  =  q+1; 
if  q  ==  5 
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%  q  =  1; 

%  k  =  k+l; 

%  end 

%  end 

dctWr ( indexGr ( : ) )  =  dctWr ( : ) ;  %  desorting  the  dctW  coefficients  using  indexNew 
dctWr  =  reshape (dctWr, Mw,Nw); 


96 


function  [Iq]  =  qFunc ( I , q_ jpeg) 

%  loannis  Retsas 
%  February  11,  2002 
%  LAST  MODIFICATION: - 
%  FUNCTION:  qFunc 

%  INPUT:  An  image  in  the  range  [0  255],  and  a  quality  factor  [1  100] . 

%  DESCRIPTION:  Performs  8x8  block  quantization  on  I,  using  the  standard  JPEG 
%  luminance  quantization  table. 

%  RETURNS:  The  quantized  image  Iq  (double  [0  255]  -  may  need  to  be 
%  transformed  to  uintS  Sin  order  to  be  displayed) . 

^•>'r*':^r*7'r*^*'>'r*^*7'r**'*^***'*^***'*^*'*^'*:'*^*'*^*'*^***'*^*'*^*'*^*'*^*'*^***^*7'r*':ir*'>'r*':^r*7'r*^*7'r*7'r*7'r*':ir* 

I  =  double  ( I )  ; 

T  =  dctmtx ( 8 ) ; 

dct I  =  blkproc(I,[8  8],'Pl*x*P2',T,T'); 

Q  =  stdJPEGQ; 
if  q_jpeg  <=  50 

q  =  50/q_jpeg; 


Q  =  q*Q; 

elseif  (50  <  q_jpeg)  &  (q_jpeg  <=  99) 


q 

=  2-(2*q_jpeg) 

/I 

o 

o 

Q 

=  q*Q; 

else 

Q 

=  ones  (8)  ; 

end 

dct  I  = 

blkproc (dctI, 

[8 

8] 

1  ,  'x./Pl' 

,Q);  %  as  above 

(using  of  one  of  the  Q  tables) 

dct  I  = 

round (dcti ) ; 

dct  I  = 

blkproc (dcti. 

[8 

8] 

1  ,  'x.*Pl' 

,Q)  ; 

Iq  =  blkproc (dcti, [8 

8 

]  ,  ' 

'  Pl*x*P2 ' 

, T ' , T) ;  %  image 

after  quantization  process 
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function  [dcti]  =  quantFunc (dcti ) 


%  loannis  Retsas 
%  Aug,  2001 

%  LAST  MODIFICATION:  Oct  10,  2001 
%  FUNCTION:  quantFunc 

%  INPUT:  A  matrix  dcti  (of  DOT  coefficients)  . 

%  DESCRIPTION:  Performs  8x8  block  quantization  on  matrix  M,  using  one  of  the 
%  offered  quantization  matrices. 

%  RETURNS:  The  quantized  matrix  dcti  (the  quantized  DOT  coefficients) . 

fprintf (' Select  Quantization  table;  Press  ... \n ') ; 

QSelection  =  input (' 1  for  binary  table; \n2  for  default  JPEG  table; \n3  for  Image 
Alchemy,  Handmade  Software  Inc.  table; \n'); 

while  (QSelection  ~=  1 )& (QSelection  ~=  2 )& (QSelection  ~=  3) 

QSelection  =  input ('Your  choise  should  be  one  of  1,2  or  3;  Try  again: \n'); 

end 

if  QSelection  ==  1 

comprRatio  =  input('Enter  the  compression  ratio  (x/64)  for  the  DCT:\n'); 

Q  =  binaryQ  (comprRatio)  ;  %  we  assign  to  Q  the  values  of  a  Quantization 

table 

elseif  QSelection  ==  2 
Q  =  stdJPEGQ; 
elseif  QSelection  ==  3 
Q  =  lAHSIncQ; 

end 

if  (QSelection  ==  2)  |  (QSelection  ==  3) 

q_jpeg  =  input ('Type  the  value  of  the  compression  factor  q_jpeg\n (default 
value  ( % )  :  50 )  : \n ' ) ; 

while  (q_jpeg  <  1)  |  (q_jpeg  >  100) 

q_jpeg  =  input ('Your  choise  should  be  an  integer  in  the  range  [1,  100]; 

Try  again : \n ' ) ; 

end 

if  q_jpeg  <=  50 

q  =  50/q_jpeg; 

Q  =  q*Q; 

elseif  (50  <  q_jpeg)  &  (q_jpeg  <=  99) 
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q  =  2- (2*q_jpeg) /lOO; 

Q  =  q*Q; 

else 

Q  =  ones  (8) ; 

end 

end 

disp ( 'Processing  Data.  .  .  ' ) 
if  QSelection  ==  1 

dcti  =  blkproc (dcti ,  [ 8  8  ]  ,  ' PI . *x ' , Q) ;  %  dctl=dctlmq,  are  the  quantized 

coef s . 

else 

dcti  =  blkproc  (dcti ,[  8  8  ]  ,  '  x . /PI '  ,  Q)  ;  %  as  above  (using  of  one  of  the  Q 

tables ) 

dcti  =  round (dcti) ; 

dcti  =  blkproc (dct I , [8  8],'x.*Pl',Q); 

end 
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function  [q]  =  stdJPEGQ 


%  loannis  Retsas 
%  Aug,  2001 
%  FUNCTION:  StdJPEGQ 
%  INPUT:  - 
%  DESCRIPTION:  - 

%  RETURNS:  The  default  JPEG  quantization  table. 


[16 

11 

10 

16 

24 

40 

51 

61; 

12 

12 

14 

19 

26 

58 

60 

55; 

14 

13 

16 

24 

40 

57 

69 

56; 

14 

17 

22 

29 

51 

87 

80 

62; 

18 

22 

37 

56 

68 

109 

103 

77; 

24 

35 

55 

64 

81 

104 

113 

92; 

49 

64 

78 

87 

103 

121 

120 

101; 

72 

92 

95 

98 

112 

100 

103 

99] 
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function  [R]  =  BER (W,  Wr) 

%  loannis  Retsas 
%  Nov  14,  2001 
%  LAST  MODIFICATION: - 
%  FUNCTION:  BER 

%  INPUT:  Two  equally  sized  matrices  W,  Wr  (uintS) . 

%  DESCRIPTION:  Calculates  the  Bit  Error  Rate  (in  error  bits  per  pixel) . 

%  RETURNS:  The  Bit  Error  Rate,  R. 

[Mw,Nw]  =  size (W) ; 

[Mwr,Nwr]  =  size (Wr) ; 
if  (Mw  ~=  Mwr)  I  (Nw  ~=  Nwr) 

display ('The  two  iputs  do  not  have  the  same  size;  rho  will  not 
calculated ' ) 

return 

end 

Wb=bitPlanes (W, 8) ; 

Wrb=bitPlanes (Wr,  8)  ; 

sum=0 ; 

for  i=l : Mw 

for  j=l : Nw 

for  k=l:8 

if  Wb(i,j,k)  ~=  Wrb(i,j,k) 
sum  =  sum+1; 

end 

end 

end 

end 

R  =  sum/ (Mw*Nw) ; 


be 
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function  [R]  =  BERmod (W,  Wr) 


%  loannis  Retsas 
%  Nov  14,  2001 
%  LAST  MODIFICATION: - 
%  FUNCTION:  BERmod 

%  INPUT:  Two  equally  sized  matrices  W,  Wr  (uintS) . 

%  DESCRIPTION:  Calculates  the  Bit  Error  Rate  (in  error  bits  per  pixel  of 
error)  . 

%  RETURNS:  The  Bit  Error  Rate,  R. 

[Mw,Nw]  =  size (W) ; 

[Mwr,Nwr]  =  size (Wr) ; 
if  (Mw  ~=  Mwr)  I  (Nw  ~=  Nwr) 

display ('The  two  iputs  do  not  have  the  same  size;  rho  will  not  be 
calculated ' ) 

return 

end 

Wb=bitPlanes (W, 8) ; 

Wrb=bitPlanes (Wr,  8)  ; 

sum  =  0;  suml  =0;  %  counters 

for  i=l : Mw 

for  j=l : Nw 

flag  =  0;  %  the  flag  is  set  to  0  for  each  new  pixel 
for  k=l:8 

if  Wb(i,j,k)  ~=  Wrb(i,j,k) 

flag  =  1;  %  the  flag  is  set  to  1  when  an  error  occurs  in  a 

pixel 

sum  =  sum+1; 

end 

end 

if  flag  ==  1 

suml  =  suml+1; 

end 

end 

end 

R  =  sum/ suml; 
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function  [Ip]  =  bitPlanes ( I , type) 


%  loannis  Retsas 
%  Aug  17,  2001 

%  LAST  MODIFICATION:  Sep  5,  2001 
%  FUNCTION:  bitPlanes 

%  INPUT:  A  grayscale  Image,  the  data  type  (8  for  uint8,  16  for  uintl6...) 

%  DESCRIPTION:  Receives  a  grayscale  -uint8-  image  as  an  input.  For  each  pixel 
%  of  the  image  gets  the  binary  representation  of  its  value.  Creates  a  set  of 
%  binary  image  planes,  each  containing  one  bit  per  pixel.  Each  plane  contains 
%  bits  of  the  same  significance. 

%  RETURNS:  A  binary  matrix  where  the  first  two  dimensions  are  the  actual 
%  dimensions  of  the  input  image,  while  the  third  dimension  represents  the 
%  different  planes,  each  containing  equally  significant  bits  of  the  binary 
%  representation  of  the  value  of  each  pixel.  Plane  1  (k=l)  contains  the  most 

%  significant  bits. 

[M, N]  =  size  ( I ) ; 

%  I  =  uint 8 ( round ( I ) ) ;  %  PROSOXH 
for  i  =  1 : M 

for  j  =  1 : N 

for  k  =  type: -1:1 

Ip (i, j, type  +  l-k)  =  bitget  ( I  ( i, j ) , k) ; 

end 

end 

end 
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function  [rho]  =  corCoef (W,  Wr) 


%  loannis  Retsas 
%  Nov  14,  2001 
%  LAST  MODIFICATION: - 
%  FUNCTION:  corCoef 

%  INPUT:  Two  equally  sized  matrices  W,  Wr. 

%  DESCRIPTION:  After  substracting  the  mean,  calculates  the  cross  correlation 
%  of  the  two  matrices  for  the  instant  that  the  two  matrices  are  aligned. 

%  RETURNS:  rho  (the  cross  correlation  of  the  two  matrices  for  the  instant 
%  that  the  two  matrices  are  aligned) . 

[Mw,Nw]  =  size (W) ; 

[Mwr,Nwr]  =  size (Wr) ; 
if  (Mw  ~=  Mwr)  I  (Nw  ~=  Nwr) 

display ('The  two  inputs  do  not  have  the  same  size;  The  program 
terminated ' ) 

return 

end 

m_r  =  mean (mean (double (Wr) ) ) ; 
m  =  mean (mean (double (W) )) ; 

Wr  =  double (Wr) -m_r; 

W  =  double (W) -m; 

sum_x  =  0;  sum  =  0;  sum_r  =0;  %  counters 

for  i  =  1 : Mw 

for  j=l : Nw 

sum_x  =  sum_x  +  W ( i , j ) *Wr ( i , j ) ; 
s  um  =  s  um  +  W ( i , j ) ^  2 ; 
sum_r  =  sum_r  +  Wr(i,j)^2; 

end 

end 

rho  =  sum_x/ ( sum*sum_r ) ^ ( 1 /2 ) ; 


is 
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function  [I]  =  ibitPlanes ( Ip) 


%  loannis  Retsas 
%  Aug  17,  2001 

%  LAST  MODIFICATION:  Sep  4,  2001 
%  FUNCTION:  ibitPlanes 

%  INPUT:  A  binary  matrix  where  the  first  two  dimensions  are  the  actual 
%  dimensions  of  the  input  image,  while  the  third  dimension  represents  the 
%  different  planes,  each  containing  equally  significant  bits  of  the  binary 
%  representation  of  the  value  of  each  pixel.  Plane  1  (k=l)  contains  the  most 

%  significant  bits. 

%  DESCRIPTION:  Performs  the  reverse  process  of  bitPlanes  function. 

%  Receives  an  image  that  has  been  decomposed  into  binary  planes  of  equally 
%  significant  bits,  and  returns  the  original  image. 

%  RETURNS:  A  grayscale  Image. 

^•>'r*'>'r*'>'r*':ir*^*7'r*'>'r**'*^***'*^*'*^*****'*^*'*^*'*^***'*^***'*^***'*^*'*^*7'r*'>'r*':ir*'>'r*7'r*'>'r*^*'>'r*'>'r*'^ 

%  Ip  =  uintS (round (Ip) ) ;  %  PROSOXH 
[L, M, N]  =  size (Ip) ; 

for  i  =  1 :  L 

for  j  =  1 :  M 

C  =  Ip  (i,  j,  1)  ; 
for  k  =  1 : N-1 

C  =  bitshift  (C, 1 ) ; 

C  =  bitor (C, Ip (i , j , k+1 ) ) ; 

end 

I  (i,  j)  =  C; 

end 

end 
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function  s=SNR(M,Md) 

%  loannis  Retsas 
%  Nov  16,  2001 
%  LAST  MODIFICATION: - 
%  FUNCTION:  SNR 

%  INPUT:  The  original  matrix  M,  and  the  distorted  matrix  Md,  expressed  in  a 
%  [0  255]  scale. 

%  DESCRIPTION:  We  calcuated  the  SNR  based  on  the  formula: 

%  SNR  =  lOlog (sigma^2/mse) ,  where  mse  =  E[(x-xd)^2]  =  (1/MN)SUM  SUM  (x-xd) ^2 
%  is  the  mean  square  error,  as  presented  in  introduction  to  Data  Compression  by 
%  Khalid  Sayood  (2nd  edition),  2000. 

%  RETURNS :  The  SNR,  s . 

M  =  double (M) ;  Md  =  double (Md) ; 
mean_M  =  mean (mean (M) ) ; 
sigmaSquare  =  0;  mse  =  0; 

%  calculation  of  sigma,  mse 
[K,  L]  =  size  (M)  ; 
for  i  =  1  :  K 

for  j  =  1  :  L 

sigmaSquare  =  sigmaSquare  +  (M (i , j ) -mean_M) ^2 ; 
mse  =  mse  +  (M(i,j)  -  Md(i,j))^2; 

end 

end 

sigmaSquare  =  sigmaSquare/ (K*L) ; 
mse  =  mse/ (K*L) ; 

s  =  10*logl0 (sigmaSquare/mse) ;  % 
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function  [V]  =  zigzag (Mi) 

%  loannis  Retsas 
%  September  12,  2001 
%  LAST  MODIFIED:  - 
%  FUNCTION:  zigzag (Mi) 

%  INPUT:  An  input  matrix  Mi. 

%  DESCRIPTION:  - 

%  RETURNS:  A  row  vector  V  that  contains  the  elements  arranged  in  zigzag 
%  fashion. 


A  =  [  1 

2 

6 

7 

15 

16 

28 

29 

3 

5 

8 

14 

17 

27 

30 

43 

4 

9 

13 

18 

26 

31 

42 

44 

10 

12 

19 

25 

32 

41 

45 

54 

11 

20 

24 

33 

40 

46 

53 

55 

21 

23 

34 

39 

47 

52 

56 

61 

22 

35 

38 

48 

51 

57 

60 

62 

36 

37 

49 

50 

58 

59 

63 

64 

V(A(  :  )  ) 

,  _ 

Mi 

(:)  ; 
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function  [Mo] 


zzRvs (V) 


%  loannis  Retsas 
%  September  12,  2001 
%  LAST  MODIFIED:  - 
%  FUNCTION:  zzRvs(Mi) 

%  INPUT:  An  input  vector  V. 

%  DESCRIPTION:  - 

%  RETURNS:  A  matrix  Mo  that  has  the  elements  of  V  arranged  in  a  zigzag  fashion. 


1 

2 

6 

7 

15 

16 

28 

29; 

3 

5 

8 

14 

17 

27 

30 

43; 

4 

9 

13 

18 

26 

31 

42 

44; 

10 

12 

19 

25 

32 

41 

45 

54; 

11 

20 

24 

33 

40 

46 

53 

55; 

21 

23 

34 

39 

47 

52 

56 

61; 

22 

35 

38 

48 

51 

57 

60 

62; 

36 

37 

49 

50 

58 

59 

63 

64]  ; 

Mo  =  V (A ( : ) ) ; 

Mo  =  reshape (Mo, 8,8); 
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function  [W] 


NFS 


%  loannis  Retsas 
%  Nov  19,  2001 
%  FUNCTION:  NFS 
%  INFUT:  - 
%  DESCRIFTION: 

%  RETURNS:  Returns  a  gray  scale  watermark  W,  64x64,  with  the  NFS  logo 
%  comprised  by  blocks  of  different  (random)  grayscale  level; 

%  REMARK:  Each  element  of  the  matrix  is  an  uintS.  Whether  it  will  be  color 
%  or  gray  depends  on  the  function  you  are  using. 

%  imagesc()  gives  the  colored  representation,  while 
%  imagescO,  colormap (gray)  gives  the  gray  scale  one. 

%  background 

W  =  round ( 255*rand ( 64 , 64 ) ) ; 

W(ll:50,:)  =  200*ones  (40,  64)  ; 

%  N 

W(17:42,5:8)  =  round (255*rand (2 6 , 4 ) ) ; 

W(19:20,9)  =  round(255*rand(2, 1) ) ; 

W(21:22, 9:10)  =  round(255*rand(2,2)); 

W(23:24, 9:11)  =  round(255*rand(2,3)); 

W(25:26, 9:12)  =  round(255*rand(2,4)); 

W(27:28, 10:13)  =  round (255*rand (2, 4 ) ) ; 

W(29:30, 11:14)  =  round (255*rand (2 , 4 ) ) ; 

W(31:32, 12:15)  =  round (255*rand (2 , 4 ) ) ; 

W(33:34, 13:16)  =  round (255*rand (2 , 4 ) ) ; 

W(35:36, 14:16)  =  round (255*rand (2, 3) ) ; 

W(37:38,  15:16)  =  round (255*rand  (2, 2) ) ; 

W(39:40,16)  =  round(255*rand(2,l)); 

W  (17 : 42,  17  :20)  =  round (255*rand (2 6 , 4 ) ) ; 

%  F 

W(17:42, 25:28)  =  round(255*rand(26,4)); 

W(17:20, 29:32)  =  round (255*rand ( 4 , 4 ) ) ; 

W(18:21, 33:34)  =  round (255*rand ( 4 , 2 ) ) ; 

W(20:23, 35:36)  =  round (255*rand ( 4 , 2 ) ) ; 

W(22:26, 37:38)  =  round (255*rand ( 5 , 2 ) ) ; 

W(25:28, 35:36)  =  round (255*rand  ( 4 , 2 ) ) ; 

W(27:30, 33:34)  =  round (255*rand ( 4 , 2 ) ) ; 


109 


W(28:31, 29:32) 
%  S 

W(19:22, 55:56) 
W(18:21, 53:54) 
W (17 :20,  49:  52) 
W  (18  :21,  47  :  48) 
W(20:23, 45:46) 
W(22  :26,  43  :  44) 
W(25:28, 45:46) 
W (27 : 30,  47  :  48) 
W(28:31, 49:52) 
W(29:32, 53:54) 
W(31:34, 55:56) 
W(33:37, 57:58) 
W(36:39, 55:56) 
W(38:41, 53:54) 
W(39:42, 49:52) 
W  (38 : 41,  47  :  48) 
W(36:39, 45:46) 


W  =  uint8 (W) ; 


round (255*rand  (4,4)); 

round(255*rand(4,2) ) ; 
round(255*rand(4,2) ) ; 
round (255*rand (4,4)); 
round(255*rand(4,2) ) ; 
round (255*rand ( 4 , 2 ) )  ; 
round(255*rand(5,2)  )  ; 
round(255*rand(4,2)  )  ; 
round(255*rand(4,2)  )  ; 
round (255*rand  (4,4)); 
round(255*rand(4,2) ) ; 
round(255*rand(4,2) ) ; 
round(255*rand(5,2) ) ; 
round(255*rand(4,2) ) ; 
round(255*rand(4,2) )  ; 
round (255*rand (4,4)); 
round(255*rand(4,2)  )  ; 
round (255*rand(4,2)  )  ; 


no 


function  [W]  =  randWm(M,N) 

%  loannis  Retsas 
%  Nov  25,  2001 
%  FUNCTION:  randWm 

%  INPUT:  The  dimensions  M,  N  of  the  watermark. 

%  DESCRIPTION: 

%  RETURNS:  Returns  a  gray  scale  random  watermark 

W  =  uint8 (round (255*rand (M, N) )) ; 


■  (MxN)  . 


Ill 


function  [W]  =  stripes (M,N) 


%  loannis  Retsas 
%  Aug,  2001 
%  FUNCTION:  stripes 

%  INPUT:  The  dimensions  M,  N  of  the  watermark. 

%  DESCRIPTION: 

%  RETURNS:  Returns  a  gray  scale  watermark  W;  11  vertical  stripes  with  the 
%  the  value  of  the  gray  scale  be  maximum  in  the  middle  stripe. 

%  REMARK:  Each  element  of  the  matrix  is  an  uintS.  Whether  it  will  be  color 
%  or  gray  depends  on  the  function  you  are  using. 

%  imagesc()  gives  the  colored  representation,  wlile 
%  imagescO,  colormap (gray)  gives  the  gray  scale  one. 


width 

=  round(N/ll 

) ; 

W(1 :M, 

1 : width)  = 

0  *  ones  (M 

, width) ; 

W (1 :M, 

width+1  :  2 

*width)  = 

50  * 

ones (M, width) ; 

W (1 :M, 

2*width+l  : 

3*width) 

=  100 

ones (M, width) 

W(1 :M, 

3*width+l  : 

4  *width) 

=  150 

ones (M, width) 

W (1 :M, 

4*width+l  : 

5*width) 

=  200 

ones (M, width) 

W (1 :M, 

5*width+l  : 

6*width) 

=  250 

* 

ones (M, width) 

W (1 :M, 

6*width+l  : 

7  *width) 

=  225 

ones (M, width) 

W (1 :M, 

7*width+l  : 

8*width) 

=  175 

ones (M, width) 

W (1 :M, 

8*width+l  : 

9*width) 

=  125 

ones (M, width) 

W (1 :M, 

9*width+l  : 

10*width) 

=  75 

ones (M, width) 

W (1 :M, 

10*width+l 

:  N)  =  25 

*  ones (M,  N-10*width) 

W  =  uint8 (W) ; 
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